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IMPRINTING IN PLANTS TO CONTROL GENE EXPRESSION 

BACKGROUND OF THE INVENTION 
Genomic imprinting is an epigenetic modification of a specific parental 
chromosome in the gamete or zygote that leads to monoallelic or differential 

S expression of the two alleles of a gene in somatic cells of the offspring. The 
general assumption is fhaf matemaffy* and patemafly-transmitted genes are 
expressed at equivalent levels in progeny. However, non*equivalent expression of 
the maternally- and paternally-transmitted genes was described in 1970 and 1983 
in plants (maize) and mammals, respectively (Alleman M, Plant Mol Biol. (2000) 

10 43:147-61). This phenomenon, named imprinting, is defined as epigenetic gene 
silencing that is set in the male or female gemi lines, resulting in a differential 
expression of maternally- and paternally-derived alleles. Imprinting affects various 
essential cellular and developmental processes, including intercellular signaling, 
RNA processing, cell cycle control, and promotion or inhibition of cellular division ' 

15 and growth. 

Many mammalian genes influenced by imprinting have been identified. The 
first deduction of imprinting at the single gene level involved a transgenic C-myc 
gene that showed dependence of its expression on paternal inheritance. The silent 
maternally Inherited copy was methylated (Swain ei a/. (1 987) Ce// 50:719-727)! 

20 The increased attention to imprinting in mammals is due to the recognition 

of its importance during development and its role in causing several human 
genetic diseases. Abnormalities of a single gene can affect imprinting of a 
proximate genomic region and disaipt multiple disease-causing genes, the 
phenotype depending upon the parental origin of the mutated gene. Imprinted loci 

25 have been implicated in disease. For example, dismpted imprinting of a locus is 
one of the causes of Prader-Willi syndnDme (PWS) and Angelman syndrome 
(AS), which involve mental retardation. PWS also causes obesity, and AS 
involves gross motor disturi^ances. Each disorder can be caused by parental- 
origin specific uniparental disomy (Nicholls ef a/. (1989) Nafu/e 342:281-285; Knoll 

30 ef a/. (1990) Am. J. Hum, Genet 47:149-1 55) or chromosomal deletions (Knoll et 
al. (1989)/\m. J. Hum. Genet 47:149-155; Mattel etat (1984) Hum. Genet 
66:313-334). 

Genomic imprinting has been implicated in cancer. The work has 
demonstrated that a balance of maternal and paternal chromosomes is required. A 
35 relative imbalance leads to neoplastic growth, and the type of neoplasm depends 
upon whether there is a maternal or patemal genetic excess. Tumors associated 
with imprinting include the two embryonic tumors, hydatidiform mole and complete 
ovarian teratoma, familial paraganglioma or glomus tumor, hepatoblastoma 
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(Rainier et ai (1995) Cancer Res. 55:1836-1838); (Li et ai (1995) Oncogene 
11:221-229), rhabdomyosarcoma (Zhan ef a/. (1994) J. Clin. Invest 94:445-448). 
and Ewing's sarcoma (Zhan et ai (1995a) Oncogene 1 1 :2503-2507). Loss of 
Imprinting (LOI) of IGF2 and H19 have also now been found in many adult tumors, 

S including uterine (Vu et aL (1995) J. Clin. Endocrinol. Metab, 80:1670-1676, 
cervical (Doucrasy et aL (1996) Oncogene 12:423-430), esophageal (HIbi ef aL 
(1996) Cancer Res. 56:480-482), prostate (Jarrard et aL (t995) C//n. Cancer Res. 
1 :147V1478), lung cancer (Kondo ef aL (1995) Oncogene 10:1 193-1 198). 
. -.choribi^rc^ ef a/. (1 995) A/af; Genet. 9:1 09-1 10), germ cell 

10 tumors (Van Gurp etaL (1994) J. NatL Cancer Inst 86:1070-1075), BWS 

(Steenman et aL (1994) Nature Genet 7:433-439); Weksberg et aL (1993) Nature 
Genet 5:143-150), and Wilms tumor (Ogawa ef aL (1993) Nature Genet 5:408- 
412). In the case of familial paraganglioma, the transmitting parent is the father 
(Van der Mey et aL (1989) Lancef 2:1291-1294). The gene has recently been 

15 localized to 1 1q22.3-q23 (Heutink ef a/. (1994) Ear. J. Hum. Genef 2:148-158)^ 

In angiosperm plants, imprinting is postulated to be essential for endospemn 
development. In Arabidopsis, the MEA gene regulates cell proliferation by exerting 
a gametophytic maternal control during seed development. Seeds derived from 
embryo sacs canying a mutant mea-l allele abort after delayed niorphogenesis 

20 with excessive cell proliferation in the embryo and reduced free nuclear divisions 
in the endospenn. The mutant mea seeds are able, at a low frequency, to initiate 
endospemn development, seed coat differentiation, and foiit maturation in the 
absence of fertilization. See, Vielle-Calzada ef aL (1999) Genes & Development 
13:2971-2982. The mea mutation affects an imprinted gene expressed matemally 

25 in cells of the female gametophyte and after fertilization only from matemally 

inherited MEA alleles. Patemally inherited MEA alleles are transcriptionally silent 
in both the young embryo and endospemn. 

A consequence of imprinting is the requirement of a 2:1 ratio of maternal to 
paternal genomes in the endosperm (Haig and Westoby 1991 , Am. Nat 134:147- 

30 155). Thus imprinting plays a significant role in the proper development of seed in 
, cereal crop^. 

Abnormal imprinting has been studied in plants by analysis of gene 
expression. Methods are needed in the art to identify imprinted genes in plants, to 
identify genes involved in endosperm development, and to manipulate gene 
35 sequences to affect imprinting. 
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BRIEF SUMMARY OF THE INVENTION 
Compositions and methods for identifying imprinting and genes regulated 
by imprinting are provided. The methods involve an analysis of the nucleotide 
sequence and the identification of CpG islands. At least two islands are involved 
S in imprinting. Thus, genes can be identified that are differentially expressed based 
on parental inheritance. In this manner, the methods are useful for determining 
the propensity of a gene to be influenced by imprinting. Such analysis involves 
determining the pattem of imprinting for cells of interest. 

It is further recognized that DNA constructs can be created v^hich show 
10 differential expression depending upon the parent of origin. To silence a 

paternally inherited allele, at least two CpG islands are utilized in the constnjct. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 shows the massively parallel signature sequencing (MPSS) 

IS analysis of ZmFIEI expression in embryo and endospemi. The graph represents 
a distribution of the 1 7-mer tags (GATCTAGTGTGTGGCTG) in the endosperm 
and embryo mRNAs generated by MPSS. The recognition site of the restriction 
enzyme DpnII, used to generate tags, is GATC. A tag sequence is derived from 
the ZmFiel EST (Accession No. AY061964) posittoned 112 nt upstream from the 

20 polyA tail. The vertical axis represents the frequency of the tags as particles per 
million (PPM) molecules sequenced on the microbeads. The horizontal axis 
represents stages of kernel development starting with unfertilized ovules (point 
"0"), and 8, 1 2, 21 , 25, and 35 days after pollination (DAP). Endospemi and 
embryos were dissected from Icemels. Note that embryo tissues were not 

25 dissected from 8 DAP kemels. Squares indicate endosperm; triangles indicate 
embryos. 

Figure 2 shows the pattern of paternal and maternal ZmFiel allele 
expression in developing kemels. The graphs represent a sizenjependent 
separation of the RT-PCR DNA fragments by the WAVE HPLC System. The larger 

30 fragments have a longer retention time on the DNASEP cartridge, which results in 
an accurate quantitative separation of the complex fragment mixture. Total RNA 
was isolated from 15 DAP kernels of selfed Mo17 and B73 lines and their 
reciprocal crosses. RT-PCR was performed with primers positioned around 1 2 nt 
deletions at 3' UTR in Mo17 background (2A, 2B). The anonymous EST was used 

35 as a control for the expression of both maternal and paternal allele in the same 
samples of RNA (2C). 

Figure 3 shows that ZmFiel Mo17 and B73 alleles are polymorphic by the 
MITE insertion at 3' UTR. The position of a common forward primer F (exon 1 1 ) 
and tiie genotype-specific reverse primers (3* UTR) are shown by arrows. DNA 
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sequence of the MITE insertion into 3* UTR of the ZmFie2 B73 allele is shown in 
SB. The target site duplication is boxed. The 14 nt terminal inverted repeats are 
marked by arrowheads. 

Figure 4 shows the genomic structure of the ZmFie loci. 1 2 kb genomic 

S segments of the ZmFiel (A) and ZmFie2 (B) regions are shown. The predicted 
start and stop codons of ZmFie coding regions are indicated by ATG and TGA. 
The positions of nucleotides are relative to the translation start codon ATG. Exons 
are shown as tall vertical boxes, untranslated regions as shorter boxes, and 
ihirons as connecting double lines. The putative transcription and translation start 

10 sites are shown as bent arrows. Regions with homology to retrotransposons are 
stippled. The direct repeats positioned upstream of ZmFie2 are marked by large 
arrows. 

Figure 5 shows the 5* upstream and coding sequence for the ZmFiel gene 
sequence. 

I S Figure 6 shows the 5' upstream and coding sequence for the ZmFie2 gene 

sequence. 

Figure 7 shows the distribution of the CpG and CpNpG methylation sites 
along the ZmFie genomic sequences. The graphs present the number of CpG or 
CpNpG sites per 100 nt. The start and stop codons are indicated by ATG and 
20 TGA. The CpG islands are marked by filled rectangles. 

Figure 8 shows a phylogenetic tree of plant FIE proteins. 
Figure 9 shows the distribution of Hpall restriction sites across the ZmFIEI 
and ZmFIE2 genomic sequences. 

Figure 10 is a table of primers designed around clusters of Hpall sites to 
25 monitor cytosine methylation. 

Figure 11 shows single nucleotide polymorphisms (SNPs) present In exon 
1 of B73 and Mo17 inbred lines. 

DETAILED DESCRIPTION OF THE INVENTION 
30 Imprinting has been observed in eukaryotic cells of plants and mammals 

(Yoder and Bestor (1996) B/o/. Ct\em. 377(10): 605-610). In humans and other 
mammals, normal imprinting underlies several fundamental cellular and 
developmental processes; thus, abnormal imprinting pattems are implicated in a 
wide variety of catastrophic human diseases. "Imprinting" is defined as an 
35 epigenetic modification of a specific parental allele of a gene, or the chromosome 
on which it resides, in the gamete or zygote, leading to differential expression of 
the two alleles in somatic cells of the offspring. That is, genomic imprinting is an 
epigenetic chromosomal modification in the germ line that leads to preferential 
expression of one of the two parental alleles in a parent-of-origin-specific manner. 
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"Normal pattern of imprinting" means preferential expression of a single parental 
allele of an imprinted gene and/or preferential methytation of a single parental 
allele of an imprinted gene. "Loss of imprinting" or "LOI" means loss of a normal 
pattern of imprinting, i.e., the loss of preferential expression of a single parental 

S allele of an imprinted gene and/or the loss of methylation of a single parental allele 
of an imprinted gene, LOI is exhibited by a variety of abnormal expression 
patterns. Such patterns include but are not limited to: equal expression of both 
alleles; significant (>5%) expression of the normally silent allele when the norinal 
case is complete silencing of one allele; epigenetic silencing of the normally 

10 expressed copy of an imprinted gene; the absence of methylation of both alleles 
and/or the methylation of both alleles where the normal case is methylation of a 
single allele. 

Imprinting is a developmental phenomenon wherein a gene in a gamete or 
zygote Is modified such that preferential expression of a single parental allele 

IS occurs in the offspring. It has been theorized that "CpG islands" present within the 
gene are subject to methylation, which causes repression of one allele (Stoger et 
a/. (1993) Cell 73:61-71 ). CpG islands are defined as sequences of 200 or more 
base pairs with a GC content greater than 0,5 and an observed-to-expected CpG 
dinucleotide content greater than 0.6 (Gardiner-Garden and Frommer (1987) J. 

20 MoL Biol. 196:261-282). Allele-specific methylation of CpG islands is a feature of 
the inactive X chromosome (Yen et al. (1984) Proc. NatL Acad. Sci USA 81:1759- 
1763) and imprinted genes including H19, Snrpn, and /g/2r (Brandeis etal. (1993) 
EMBOJ 12:3669-3677; Shemer et al. (1997) Proc. NatL Acad. ScL USA 94:10267- 
10272; Wutz et al. (1997) Nature 389:745-749). Analysis of orthologous genomic 

25 domains of approximately 1 Mb in mouse and human identified nine conserved 
imprinted genes; in eight of these, two or more conserved CpG islands were found 
upstream of or within the gene. In contrast, six non-imprinted genes within the 
same region were associated with at most one CpG island (Onyango etaL (2000) 
Genome Research 10:1697-1710). 

30 The present invention has identified CpG islands in plants and attributes 

differential expression of imprinted plant genes to CpG islands. Accordingly, the 
methods of the invention encompass the identification of imprinted plant genes by 
detemiining the presence of CpG islands. Where sequence information is 
available for a plant, the sequence can be searched for GC rich regions and 

35 further testing can be done to establish the location of CpG islands. 

Methods for the determination of the pattern of imprinting are known in the 
art. It is recognized that the methods may vary depending on the gene to be 
analyzed. Generally, in methods for assaying allele-speciftc gene expression, 
RNA is reverse transcribed with reverse transcriptase, and then PCR is perfonmed 
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with PGR primers that span a site within an exon where that site is polymorphic 
(i.e., normally variable in the population), and this analysis is performed on an 
individual that is heterozygous (i.e., infonnative) for the polymorphism. One then 
uses any of a number of detection schemes to determine whether one or both 

S alleles is expressed . Methods for the assessment of gene expression , allele- 

specific gene expression, and DNA methylation are encompassed. Additionally, 

direct approaches to identifying novel imprinted genes include: positional cloning 
efforts aimed at identifying imprinted genes near other known imprinted genes 
(Barlow et ai (1 991 ) Nature 349:84-87); techniques comparing gene expression 

10 (Kuroiwa et ai (1996) Nat Genet 12:186-190); and restriction landmari^ genome 
scanning (Nagai et ai (1995) Biochem. Biophys. Res. Commun, 213:258-265). 
See also, Rainier et ai {^993) Nature 362:747-749; which teaches the 
assessment of allele-specific expression of IGF2 and HI 9 by reverse-transcribing 
RNA and amplifying cDNA by PGR using new primers that permit a single round 

IS rather than nested PGR; Matsuoka et al. (1 996) Proc. Natl. Acad. Sci USA 

93:3026-3030, which teaches the Identification of a transcribed polymorphism in 
p57'^^^; Thompson etai (1996) Carreer Research 56:5723-5727. which teaches 
determination of mRNA levels by RPA and RT-PGR analysis of allele-specific 
expression of p57'^^^; and Lee et al. (1997) Nature Gerietics 15:181-185, which 

20 teaches RT-PGR SSCP analysis of two polymorphic sites. Such disclosures are 
herein incorporated by reference. 

Direct approaches developed to identify novel imprinted genes include: 
positional cloning, which identifies imprinted genes near other known imprinted 
genes (Bariow a/. {^99^) Nature 349:84-87); comparing gene expression in 

25 parthenogenetic embryos to that of normal embryos (Kuroiwa et al. (1 996) Nat 
Genet 12:186-190); and restriction landmari< genome scanning (Nagai et al 
(1995) Biochem. Bionhys. Res. Commun. 213:258-265). The last approach 
comprises analysis of clonality in tumors by assessing DNA methylation near a 
heterozygous polymorphic site (Vogelstetn et at (1985) Science 227:642-645). 

30 As noted above, a distribution of GpG islands within genes can. be used as 

a predictive tool for genes regulated by imprinting. To date, inriprinted genes in 
plants are important components of regulation of endosperm size and growth. 
Thus, the methods of the invention can be used to identify genes involved in 
endosperm development. In particular, the invention can be used as a predictive 

35 tool for plant genes, dicot and monocot genes, particulariy maize genes, that are 
regulated by imprinting. 

It is also recognized that the GpG islands of the invention may be used to 
silence paternally transmitted genes. In this manner, DNA constructs comprising 



wo 03/078580 PCTAJS03/07552 

-7- 

at least two CpG islands will be operably linked with a coding sequence and a 
promoter that is expressed in plants. 

A number of promoters can be used in the practice of the invention. The 
promoters can be selected based on the desired outcome. The nucleic acids can 
S be combined with constitutive, tissue-prefen'ed, or other promoters for expression 
in plants. 

Such constitutive promoters include, for example, the core promoter of the 
Rsyn7 promoter and other constitutive promoters disclosed in WO 99/43838 and 
U.S. Patent No. 6,072.050; the core CaMV 358 promoter (Odell etaL (1985) 

10 Nature 313:810-812); rice actin (McElroy et aL (1990) Plant Cell 2:163-171); 
ubiquitin (Christensen etaL (1989) Plant MoL Biol 12:619-632 and Christensen 
a/. (1 992) Plant Mol. Biol. 1 8:675-689); pEMU (Last et a/. (1 991 ) Theor. Appl. 
Genet. 81:581-588); MAS (Velten etal. (1984) EMBO 1 3:2723-2730); ALS 
promoter (U.S. F^atent No. 5.659,026), and the like. Other constitutive promoters 

IS include, for example, U.S. Patent Nos. 5.608.149; 5,608.144; 5.604,121 ; 
5.569.597; 5.466,785; 5.399,680; 5,268,463; 5.608.142; and 6,177.611, 

Chemically-regulated promoters can be used to modulate the expression of 
a gene in a plant through the application of an exogenous chemical regulator. 
Depending upon the objective, the promoter may be a chemical-inducible 

20 promoter, where application of the chemical induces gene expression, or a 

chemical-repressible promoter, where application of the chemical represses gene 
expressbn. Chemical-inducible promoters are known in the art and include, but 
are not limited to, the maize in2-2 promoter. Which is activated by 
benzenesulfonamide herbicide safeners. the maize GST prbmoter. which is 

25 activated by hydrophobic electrophilic compounds that are used as pre-emergent 
herbicides, and the tobacco PR-la promoter, which is activated by salicylic acid. 
Other chemical-regulated promoters of interest include steroid-responsive 
promoters (see, for example, the glucocorticoid-inducible promoter in Schena et - 
aL (1991) Proc. NatL Acad, ScL USA 88:10421-10425 and McNellis etaL (1998) 

30 Plant J, 14(2):247-257) and tetracycline-inducible and tetracycline-repressible 
promoters (see, for example, Gatz et aL (1991 ) MoL Gen. Genet, 227:229-237, 
and U.S. Patent Nos. 5,814.618 and 5,789.156), herein incorporated by reference. 

Tissue-prefenred promoters can be utilized to target enhanced expression 
within a particular plant tissue. Tissue-preferred promoters include Yamamoto et 

35 aL (1997) Plant J. 12(2):255-265; Kawamata ef aL (1997) Plant CellPhysioL 
38(7):7g2-803; Hansen et aL (1997) MoL Gen Genet. 254(3):337-343; Russell ef 
aL (1997) Transgenic Res. 6(2):1 57-168; Rinehart etaL (1996) Plant PhysioL 
1 12(3):133M341 ; Van Camp et aL (1996) Plant PhysioL 1 12(2):525-535; 
Canevascini etaL (1996) Plant PhysioL 112(2):51 3-524; Yamamoto etaL (1994) 
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Plant CellPhysioL 35(5):773-778; Lam (1994) Results Probi Cell Differ. 20:181- 
196; Orozco etal. (1993) Plant MolBioL 23(6):1 129-1 138; Matsuoka etaL (1993) 
ProcNatl. Acad. Sci. USA 90(20):9586-9590; and Guevara-Garcia etal. (1993) 
Plant J. 4(3):495-505. Such promoters can be modified, if necessary, for weal; 
expression. 

"Seed-prefenred" promoters include both "seed-spedfic" promoters (those 
promoters active during seed development siich as promoters of seed storage ' 
proteins) as well as "seed-germinating" promoters (those promoters active during 
seed gemiination). See Thompson et al. (1989) BioEssays 10:108, herein 
incorporated by reference. 

Examples include, for dicotyledonous plants, a bean p-phaseolin promoter, a 
napin promoter, a p-conglycinin promoter, a cruciferin promoter, and a soybean 
lectin promoter. For monocotyledonous plants, promoters useful in the practice of 
the invention include, but are not limited to. cZ19B1 (maize 19 kDa zein). miips 
(myo-inositol-1 -phosphate synthase), eel A (cellulose synthase) (see WO 
00/11177, herein incorporated by reference), a maize 15 kD zein promoter, a 22 
kD zein promoter, a 27Kd y-zein promoter (such as gzw64A promoter, see 
Genbank Accession #878780), a waxy promoter, a shrunken-1 promoter, a 
globulin 1 promoter (See Genbank Accession # L22344). an Itp2 promoter (Kalla, 
et al.. Plant Journal 6:849-860 (1994); U.S. Patent 5,525,716). cimi promoter 
(U.S. Patent 6.225,529), maize endl and end2 promoters (See U.S. patent 
applications 09/383.543. filed August 26, 1999. and 10/310.191 , filed December 4. 
2002). and the shrunken-2 promoter. See also U.S. patents 6.407.315 and 
6,403,862. However, other promoters useful in the practice of the invention are 
known to those of skill In the art such as nucellain promoter (See C. Linnestad. et 
aL, Rant Physiol. 118:1169-80 (1998)), kni promoter (See S. Hake and N. Ori. 
B8: Interactions and Intersections in Plant Pathways, Coeur d'Alene, Idaho. 
Keystone Symposia, February 8-14, 1999, at 27.), and F3.7 promoter 
(Baszczynski et al., Maydica 42:189-201 (1997)). Spatially acting pronrK)ters such 
as gibl. an embryo-preferred promoter; or gamma zein, a n endospemvpreferred 
' p romoter, or BETL1 (S ee G. Hueros, et al. Plant Physiology 121:1143-1152 
(1999)), are particularly useful. The use of temporally acting promoters is also 
contemplated by this invention. Promoters that act from 0-25 days after 
pollination (DAP) are prefen-ed, as are those acting from 4-21, 4-12. or 8-12 DAP. 
In this regard, promoters such as cimi and Itp2 are prefenred. Particularly 
preferred promoters include maize zag2.1 (GenBank Accession X80206), maize 
zap (see U.S. Provisional Patent Application 60/364.065). maize ckx1-2 promoter 
(see U.S. Patent Publication 2002-0152500 Al), maize end2 (see U.S. Patent 
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6,528.704, and also U.S. Patent Application 10/310,191, filed December 4, 2002), 
and maize led (see U.S. Patent Application 09/718,754, filed December 27, 2002). 

Transformation protocols as well as protocols for introducing nucleotide 
sequences into plants may vary depending on the type of plant or plant cell, i.e., 
S monocot or dicot, targeted for transformation. Suitable methods of introducing 
nucleotide sequences into plant cells and subsequent insertion into the plant 
genome include microinjection (Crossway ef al. (1986) Biotechniques 4:320-334), 
electroporation (Riggs ef a/. (1986) Proc. NatL Acad. Sci. USA 83:5602-5606^. . . 
>\g/Tobacter/t;m-mediated transformation (Townsend et a/,. U.S. Patent No. 
10 5,563.055; Zhao ef a/.. U.S. Patent No. 5,981,840), direct gene transfer 
(Paszlcowsl^i ef a/. (1984) EMBO J. 3:2717-2722), and ballistic particle 
acceleration (see, for example, Sanford et al., U.S. Patent No. 4,945,050; Tomes 
et aL. U.S. Patent No. 5,879,918; Tomes ef a/., U.S. Patent No. 5,886,244; Sidney 
ef a/., U.S. Patent No. 5,932,782; Tomes ef al. (1 995) "Direct DNA Transfer Into 
15 intact Plant Cells via Microprojectile Bombardment," in Plant Cell, Tissue, and 
Organ Culture: Fundamental Methods, ed. Gamborg and Phillips (Springer-Verlag, 
Berlin); McCabe et a/. (1988) S/ofec/7r7o/ogy 6:923-926); and Led transformation 
(WO 00/28058). Also see Weissinger ef a/. (1988) Ann. Rev. Genet. 22:421-477; 
Sanford et aL (1987) Particulate Science and Technology 5:27-37 (onion); 
20 Christou ef a/. (1988) Plant Physiol. 87:671-674 (soybean); McCabe ef a/. (1988) 
Bio/Technology 6:923-926 (soybean); Finer and McMullen (1991) In Vitro Cell 
Dev. Biol. 27P:175-182 (soybean); Singh etaL (1998) Theor. Appl. Genet 96:319- 
324 (soybean); Datta ef al. (1990) Bjofec/7no/ogy 8:736-740 (rice); Klein ef a/. 
(1988) Proc. NatL Acad. Sci. USA 85:4305-4309 (maize); Klein ef a/. (1988) 
25 Biotechnology 6:559-563 (maize); Tomes, U.S. Patent No. 5,240,855; Buising ef 
a/., U.S. Patent Nos. 5,322,783 and 5,324,646; Tomes ef a/. (1995) "Direct DNA 
Transfer into Intact Plant Cells via Microprojectile Bombardment,** in Plant CelL 
Tissue, and Organ Culture: Fundamental Methods, ed. Gamborg (Springer-Verlag, 
Berlin) (maize); Klein ef aL (1988) Plant Physiol. 91:440-444 (maize); Fromm ef al. 
30 (1990) Biotechnology 8:833-839 (maize); Hooykaas-Van Slogteren ef aL (1984) 
Nature (London) 31 1:763-764; Bowen etaL, U.S. Patent No. 5.736.369 (cereals); 
Bytebier ef aL (1987) Proc. NatL Acad. Sci. USA 84:5345-5349 (Liliaceae); De 
Wet ef al. (1985) in The Experimental Manipulation of Ovule Tissues, ed. 
Chapman ef aL (Longman. New Yorl^), pp. 197*209 (pollen); Kaeppler et aL (1990) 
35 Plant Cell Reports 9:41 5-418 and Kaeppler ef a/. (1992) Theor. AppL Genet 
84:560-566 (whisker-mediated transformation); D'Halluin ef aL (1992) Plant Cell 
4:1495-1505 (electroporation); Li ef aL (1993) Plant Cell Reports 12:250-255 and 
Christou and Ford (1995) Annate of Sofany 75:407-413 (rice); Osjoda etaL (1996) 
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Nature Biotechnology 14:745-750 (maize via Agrobactehum tumefaciens): all of 
which are herein incorporated by reference. 

The cells that have been transfonned may be grown into plants in 
accordance with conventional ways. See, for example, McCormick at ai (1986) 

5 Plant Cell Reports 5:81-84. These plants may then be grown, and either 
pollinated with the same transfonmed strain or different strains, and resulting 
plants having desired expression of the subject phenotypic characteristic may be 
identified. Two or more generations may be grown to ensure that the desired 
expression of the subject phenotypic characteristic is stably maintained and 

10 inherited and then seeds harvested to ensure that desired expression of the 
subject phenotypic characteristic has been achieved. 

The following examples are offered by way of illustration, not by way of 
limitation. 

15 EXPERIMENTAL 
Introduction 

A fundamental problem in biology is to understand how fertilisation initiates 
reproductive development. In flowering plants, the female gametophyte, or 
embryo sac, is composed of egg, central, synergid, and antipodal cells. Double 

20 fertilization triggers development of the egg into a diploid embryo and 

development of the central cell into a triploid endosperm. In sexually-reproducing 
plants, the embryo sac never develops into seed without fertilization. In asexually- 
reproducing apomictic plants, the egg cell develops parthenogenetically without 
fertilization to produce the embryo, but in many species the endospemi 

25 development may still require fertilization (non-autonomous apomicts) (Grimanelli 
ef a/. (2001) Trends Genet. 17(10):597-604). 

A number of mutants that initiate fertilization independent seed (FIS) 
development have been isolated in Arabidopsis (Ohad etaL (1996) Proc. Natl. 
Acad. ScL USA 93(1 1 ):5319.5324: Chaudhury at at. (1997) Annu. Rev. Cell Dev. 

30 BioL 17:677-699). These mutants uncouple seed development from the 
fertilization process and display some characteristics of apomixis, such as 
autonomous endosperm development. A mutational approach has revealed three 
genes with similar FIS phenotypes: FIS1/MEDEA, which is related to the 
Polycomb group (PcG) protein EZ (enhancer of Zest) of Drosophila (Grossniklaus 

35 et at. (1998) Science 280:466-450; Luo et ai (1999) Proc. Natl. Acad. ScL USA 
94(8):4223-4228); FIS2, which is a C2H2 Zinc Finger transcriptional regulator that 
may have a similar function to Hunch back protein of the Drosophila PcG complex 
(Luo et al. (1999) Proc. Natl. Acad. ScL USA 94(8):42234228); and FIS3/FIE, 
which is a homologue of the PcG protein ESC (extra sex combs) (Ohad et a/. 
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(1 999) Plant Cell 1 1 :407-416). Polycomb group proteins are conserved among 
eukaryotes and are involved in the repression of homeotic genes during early 
devetopment in flies and mammals. One could speculate that FIS genes define a 
PcG- like complex in plants that suppresses the development of the endosperm in 

S the absence of fertilization (Grossniklaus et ai (1 998) Science 280(5362):466-450; 
Luoef a/. (1999) Proc. Natl. Acad. Sd. L/SA 94(8):4223-4228; Ohad etal. (1999) 
P/anfCe// 11:407-41 6). 

The Arabidopsis model provides candidate genes for revealing similar 
pathways in other plants, A search of the homologues in a proprietary maize EST 

10 (Expressed Sequencing lags) database identified two maize genes, ZmFlel and 
ZmFie2 (see WO 01/16325. herein incorporated by reference). The putative FIE 
maize proteins share 57-68% identity with the Arabidopsis FIE protein. FIS2/3 
genes do not demonstrate such a remarkable conservation. A duplication of the 
maize Fie gene raises a questbn about their functional redundancy. The two 

IS ZmFie genes show a different pattem of expression in vegetative and reproductive 
tissues, but they may have overlapping function in the devetoping kernels. 

In this example, the expressbn of two maize Fie genes in developing 
kernels has been analyzed by several different methods, which lead to the 
conclusion that the two ZmFie genes may have nonredundant functions. Based 

20 on the expression pattem and a temporal type of imprinting, ZmFie2 is likely to be 
a functional homologue of the Arabidopsis FIE gene and most likely is involved in 
the repression of endosperm development before pollination. The expression of 
ZmFiel is triggered in endosperm after pollination, which implies no repressive 
function in the embryo sac before pollination, but reveals a new endosperm- 

25 specific FIE function in maize. Only the matemal ZmFIEI allele Is expressed 
during kernel development, implying a strong regulation by imprinting. Based on 
the genomic sequences of ZmFIE genes, different models for temporal and 
pemnanent types of imprinting are proposed. Thus far the ZmFIEI gene is found 
only in maize, which is likely to be a consequence of its allotetraploid origin. 

30 

Experimental Procedures 

RAM Gel Blot Analysis. 

To analyze ZmFIE expression in developing kernels. mRNA was isolated 
35 from non-pollinated ovules at silking and from kernels at 3, 6, 9. 1 2, and 1 5 days 
after pollination (DAP). Total RNA was extracted from 1 g of material using a hot 
phenol extraction procedure and a selective precipitation with 4 M LiCI to remove 
traces of DNA and small RNA species (Venvoerd et ai (1989) Nucleic Acids Res. 
17:2362; Brugifere et ai (1999) Plant Cell 1 1 :1995-2012). For each time point. 
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kernels were collected from two ears harvested from two different plants 
(replications) from either the B73 or Mo17 inbred lines. RNA was quantified using 
a spectrophotometer at 260 nm. Poly(A) was prepared from total RNA (400 \ig) 
using the Oligotex^'^ poly(A) purification kit (Qiagen). For gel blot experiments, 

S poly(A) RNA enriched samples were prepared as described by Becker et al. 
(1993) Methods EnzymoL 218:568-587. Three |jg of polyA RNA were loaded in 
each lane. Electrophoretk: separation was performed on 1.5% agarose gels 
containing 5% (v/v) of a solution of 37% formaldehyde in Mops buffer (0.02 M 
Mops, pH 7.0, 5 mM sodium acetate, and 1 mM EDTA). Gels were blotted onto a 

10 nylon membrane (Roche Molecular Biochemicals) using TurboBlotter (Schleicher 
& Schuell), with 20xSSC (IxSSC is 150 mM NaCi. 15 mM sodium citrate) as 
transfer buffer. Blots were probed with ^P-labeled 300bp fragments of ZmFIEI or 
ZmF/£2 cut from the 3* UTR of the appropriate ETS clones. The fragment 
sequences shared no homology, which avoided cross-hybridizations. Actin probe 

1 S was used as a loading control. 

Dfeffngu/s/i/ng ZmF/e mRWAs /n Rec/proca/ Crasses^ 

Reciprocal cnDSses between B73 and Mo17 inbred lines were perfomied, 
and F1 kernels were sampled at 2, 5, 10, and 1 5 days after pollination (DAP). 

20 Total RNA was isolated and reverse PGR reactions were performed with 

"Superscript kit." The PGR product differed between B73 and Mo17 alleles in a 
12 nt deletion. PGR product was separated on HPLG WAVE machine to 
distinguish between B73 and Mo17 alleles. 

Primers to amplify ZmFIE2 were designed based on the MITE insertion in 

25 the B73 ZmFIE2 allele. In B73 background, ZmFIE2 polyA transcripts are 
tenninated in the middle of this insertion. In Mo17 background, ZmFIE2 polyA 
transcripts are tenninated within genomic sequence with no homology to MITE 
insertion. (See Figure 3A.) The forward primer positioned in exon eleven. 5 - 
GGTGAAGGCAAAATCTACGTGTGG-3', (SEQ ID NO: 2) is common for both 

30 genotypes. The reverse primer 5 - CATTACGTTACAAATATGTGAACCAAACG- 
3* (SEQ ID NO: 3) is specific for the B73 allele; reverse primer 5 - 
CAGAACAAACAGATGACAACGGTTCCCAAAG-3' (SEQ ID NO: 4) is specific for 
the Mo17 allele. This primer combination allows for monitoring of B73 and Mol 7 
ZmFIE2 allele expression in developing kernels of the reciprocal crosses by RT- 

35 PGR. 

In Situ Hybridization. 

To detenmine expressbn patterns of ZmFIE genes in maize, in situ 
hybridization was perfonned using the protocol of Jackson (1991 ) in In situ 
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Hybridization in Plants, Molecular Plant Pathology: A Practical Approach, ed. 
Bowles etal. (Oxford University Press, England), pp. 63-74. Sense and antisense 
mRNA probes of 300 bp conresponding to the 3* UTR of ZmFIE genes were 
labeled non-isotopically with digoxigenin-UTP by in yntro transcriptton with T7 and 

S T3 RNA polymerases (Roche Molecular Biochemicals). Probes were hybridized 
with fixed sections of maize tissues from ovules at silking, and kemels-at 5, 8, and 
12 DAP. Following extensive washing to remove unbound probe, signal was 
detected with anti-DIG-antibodies conjugated with alkaline phosphatase to 
mediate color reaction (Roche Molecular Biochemicals) that leads to a purple-blue 

10 . precipitate in the cells that contain mRNA. ZmFIE mRNAs were detected 

specifically with the antisense probe; the sense probe did not hybridize, therefore 
serving as a negative control. 

Cloning and Sequencing of ZmFIE Genomic Fragments. 
IS SAC genomic libraries were screened with ZmFIEI and ZmFIE2 ESTs. 

Five BAC clones per each gene were identified and confimied by Southern 
hybridization. Hindlll and EcoR1 BAC fragments subcloned into vector Bluescriptll 
(KS) (Stratagene) were hybridized wHh ZmFIE probes, and positive clones were 
sequenced. 

20 

DNA Sequence Analysis, 

DNA assembly was perfonned using the Sequencher program (Genecode, 
Ann Arbor, Ml). BLAST search of GenBank was used for sequence annotation. 
Sequence analysis was performed with GCG® programs (Accelrys, Inc., San 
25 Diego, CA). 

Nucleotide Sequence Accession Numbers, 

The. sequences have been deposited in the GenBank database under 
Accession No. AY061964 (ZmFiel genomic locus), and AY061965 {ZmFie2 
30 genomic locus). 

Example 1: Maize FIE (Fertilization Independent Endosperm) Homologues: 
Two Related Genes with Distinct Expression Patterns. 

Results 

35 Expression of ZmFIE Genes in Developing Kernels. 

ZmFie genes have a different pattem of expression in vegetative and 
reproductive tissues. Expression of ZmFIEI was detected only in developing 
kemels, not in vegetative tissues. Conversely, ZmFIE2 expression was found in all 
tissues tested. If these genes participate in repression of embryo sac development 
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before fertilization in a manner similar to the Arabidopsis FIE homologue, they 
should be expressed in the ovules before fertilization. To understand the function 
of both genes, their expression in ovules and developing kernels was detected by 
mRNA gel blot experiments, gene expression analysis by massively parallel 
signature sequencing (MPSS) (Brenner et al. (2000) Nat BiotechnoL 18:630-634). 
and by /n s/fa hybridization. 

For RNA gel blot experiments. mRNA was isolated from non-pollinated 
ovules and from developing kernels at 3, 6, 9, 12, and 15 days after pollination 
(DAP). ZmFIEI mRNA is not detected in ovules and 3 DAP kernels. It appears 
first in 6 DAP kernels, reaching a maximum of expression in 9 DAP kernels, and 
gradually declines at later stages. The expression pattern of ZmFIE2 is very 
different: mRNA is detected in ovules and all stages of developing kernels, but 
declines after 6 DAP. RNA gel blot experiments demonstrate a low-abundance of 
ZmFIE2 mRNA, compared to ZmF/EtmRNA, which shows significantly higher 
expression. 

To achieve a more sensitive assay of ZmFIE expression, these cDNA 
sequences were searched with a BLAST algorithm against the gene expression 
database generated by the MPSS method from different maize tissues. Massively 
parallel signature sequencing (MPSS) generates 17-mer sequencing tags of 
millions of cDNA molecules, which are in vitro cloned on microbeads (Brenner et 
ai (2000) Nat Biotechnol. 18:630-634). The technique provides an unprecedented 
depth and sensitivity even for messages that are expressed at very low levels. 
MPSS is based on the DpnII (GATC) restriction site availability in cDNA templates. 
If the site is absent, the 17-mer tags are not generated. ZmFie2 does not have the 
appropriate DpnII site and is not suitable for MPSS analysis. For this reason, only 
ZmFIEI tags were found. Distributions of the ZmFIEI tags in MPSS experiments 
are shown in Figure 1. No tags were detected in mRNA isolated from ovules. 
Thus, \i ZmFIEI were transcribed in ovules, It would produce less than one mRNA 
molecule per 10^ total mRNA molecules. At 8 DAP, the number of tags is about 
600 PPM (particles per million), gradually decreasing at later stages and reaching 
20 PPM at 35 DAP. No tags are found in 40 DAP kernels. This trend is in 
complete agreement witti mRNA gel blot experiments and RT-PCR (data not 
shown). The second important observation from MPPS experiments is the 
expression of ZmFiel in the developing endosperm. Embryo and endosperm were 
dissected for MPSS experiments from kernels as early as 10 DAP. At this stage, 
ZmFIEI expression is approximately 20-30 times higher in endosperm than in 
embryo. MPSS analysis strongly suggests that transcription of ZmFIEI is 
activated in developing kernels approximately 5-6. days after pollination, 
predominantiy in endosperm. 
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Because this type of analysis is not available for ZmFIE2, in situ 
hybridization was performed. Longitudinal sections of B73 ovules and kernels at 
2, 5, 8 and 15 DAP were prepared and hybridized with antisense RNA probes, and 
with sense RNA probes as a negative control. The sense probe revealed no 

S background signals, and images are not shown. ZmFIE2 antisense probes gave a 
signal in the embryo sac of the mature ovules at silking. At 2 DAP, zygotes had a 
significantly increased signal compared to ovules, indicating that ZmFIE2 
transcription is activated de novo, and the signal intensity may not be explained by 
the pre-existing maternal RNA. In kernels at 5 DAP, the most intense signal 

10 appeared in the embryo-surrounding region and on the periphery of the 

developing endosperm. At the later stage of 15 DAP, the signal persists in the 
embryo and is not detectable in the endosperm. It shows also the clear pattern of 
an axis polarity, being more intensive in the areas of leaves and roof primordia. 
In summary, ZmFiE2 gene is expressed in the embryo sac before 

IS pollination and in developing embryo after fertilization, as well as in vegetative 
tissues. This pattern of expression is very similar to that observed for Arabidopsis 
FIE, but very different from that observed for ZmF/El 

Pattern of Maternal and Paternal ZmFie Allele Expressbn During Kernel 

20 Development 

The Arabidopsis FIE gene demonstrates a parent-of-origin effect on seed 
development, suggesting that only the matemal FIE allele is essential, whereas 
the paternal FIE allele plays no role in seed development (Yadegari et al. (2000) 
Plant Cell 12:2367-2382; Luo et a/. (2000) Proc. Natl. Acad, ScL USA 

25 97(1 9): 10637-1 0642). CunBnt evidence supports the nrKXiel that the FIE gene is 
an imprinted gene, in which the matemal allele is expressed and the paternal 
allele is silenced during seed development (Yadegari et aL (2000); Luo et al 
(2000) Proc. WafA Acad. Scl USA 97(1 9):1 0637-1 0642). To understand whether 
the maize FIE homologues are regulated by imprinting in the same manner as the 

30 Arabidopsis FIE gene, the paternal- and matemal-specific FIE mRNA levels were 
measured in developing kernels. 

To distinguish matemal and paternal ZmFIE mRNAs. the insertion/deletion 
sequencing polymorphism was identified in both ZmFIEI and ZmFIE2 genes in 
inbred lines Mo1 7 and B73. Reciprocal crosses were performed between B73 and 

35 Mo17 lines, and kemels were collected at 2, 5, 10. 15, and 16 DAP. Ovules and 
selfed kemels from both inbred lines were sampled at 1 1 DAP as controls. Total 
RNA was extracted from the whole kernels. 

Mo17 and B73 ZmFIEI alleles are different by a 12 nt insertion/deletion in 
the 3' UTR. The reverse and fonA^ard primers were designed around this indel to 
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produce the 300bp RF-PCR product, which was separated on D-KPLC cofumn by 
WAVE machine. As shown in Figure 2A and 2B, only maternal ZmFIEI RNAs 
were detected in reciprocal crosses in 15 DAP kernels. No detectable level of the 
patemal RNA was found at early stages (data not shown). The same set of RNAs 

S was used with an anonymous gene as a control for bi-allelic expression (Figure 
2C). The patemal allele of a control non-iniprinted gene was detected in 5 DAP 
kernels and all later stages, confiiming that the patemal gene is expressed in 
kemefs. Thus, ffie ZmFfEl pafernaf affefe undergoes transcrfpffonaf sifencmg fn 
developing kernels, and this gene is regulated by imprinting. noted above, 

10 ZmFIEI is expressed predominately in endospemi; this Is in agreement with 
previous reports that all known imprinted genes in plants are expressed in triploid 
endospenm. Thus far, imprinting has not demonstrated for genes expressed in 
diploid tissues. 

A different strategy was used for monitoring allelic expression of ZmFieZ. 

15 ZmFie2 genomic sequence from inbred B73 contains the 185 nt MiTE insertion at 
3* UTR, which is not present in the Mo1 7 allele (Figure 3A). The insertion is 
flanked by 15-nt inverted repeats and creates the 5 nt direct target duplicatbn 
(Figure 3B). These features are typical for MITE elements, which are very 
abundant components of the maize genome (Wessler (2001 ) Plant Physiol. 

20 125(1 ):149-51). In B73, ZmFIEI polyA transcripts are temilnated in the middle of 
the MITE insertion. In Mo17 background, ZmFIE2 polyA transcripts are 
tenminaled within genomic sequence with no homology to MITE. The MITE 
sequence was used to design allele-specific primers to discriminate between B73 
and Mol 7 ZmFIE2 mRNAs (Figure 3A). 

25 The fonvard primer, F, designed for exon 1 1 , is common for both 

genotypes. The reverse primers, R, are genotype specific. The primer 
combinations are highly allele- specific; no RT-PCR products are found in RNA 
samples from ovules or selfed homozygous kemels. The primers allow monitoring 
of the expression of matemal and patemal ZmFlE2 alleles In developing kemels. 

30 Maternal allele expression was detected at all stages In both redprocal crosses, 
being more abundant in 2 DAP zygotes. These results are in agreement with the 
in situ hybridization data, which demonstrated an increased Zn7F/£2 expression in 
2 DAP zygotes in the embryo-surrounding region. Paternal allele expression is 
delayed up to 10 DAP, but at later stages, both maternal and patemal alleles are 

35 expressed. Delayed expression, but not a complete silencing, of the patemal allele 
is a feature of the Arabidopsis FIE gene. As mentioned previously, the ZmFIEI 
gene undergoes pemnanent silencing of the paternal allele, demonstrating a 
different type of imprinting. 
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Genomic Structure of ZmFIE Loci. 

The transcriptionai pattern of ZmFIE genes is very different with respect to 
tissue specificity, efficiency, and imprinting. ZmFIEI is expressed only in 
devebping kernels at a relatively high level, and with a permanent silencing of the 

S paternal allele. Conversely, ZmFIE2 is expressed in vegetative and reproductive 
tissues, showing a very low level of expression in developing kernels, with delayed 
paternal allele expression. To reveal the molecular mechanisms underlying the 
different patterns of ZmFIE expression, the genomic loci of both genes have been 
sequenced. Genomic BAG libraries were screened with ZmFIEI and ZmFIE2 

10 cDNAs. Five BACs were identified for each gene covering the overlapping regions 
(about 250 kb). Approximately 12-kb segments canying ZmFIE genes have been 
subcloned and sequenced (Figure 4A and 4B) . The positions of nucleotides are 
relative to the translation start site, ATG (+1 ). (The transcription start site is used 
more often as a reference point, but it is not identified precisely for FIE 

15 transcripts.) 

The coding regions of both genes downstream of the translation start site, 
ATG, possess 13 exons, which are identical in size between the two genes, 
except for the first and last exons where initiation and termination of transcription 
occur The number and sizes of the protein coding exons are also identical to the 

20 Arabidopsis FIE gene (GenBank Accession No. AF 1295 16). The intron 
sequences vary in length and do not share a significant homology between 
ZmFIEI and ZmFIE2 and Arabidopsis. However, ZmFIEI demonstrates a unique 
feature among the FIE family, the presence of a 290 bp intron, located in the 5' 
UTR. just 6 nucleotides upstream from the ATG codon (-6 and 390). The first exon 

25 and intron are very often required for high level expression of the reporter, which 
hiay be a result of the increased level or stability of the mature cytoplasmic mRNA 
constmcts (Kim and Guiltinan {^999) Plant PhysioL 121(1 ):225*236); Clancy a/. 
(1994). It is very likely that the 5' UTR intron of ZmFIE 1 plays a regulatory role or 
determines the tissue specificity of FIE1 protein expression. 

30 The 5' upstream regions of the two genes are very different. The size of 

the putative promoter region of the ZmFIEI gene is estimated to be about 900 nt, 
between the RNA start of the longest EST (Accession No. AY061964) and the 
retrotransposon RIRE LTR (Figure 4A; Figure 5). Dot plot analysis (data not 
shown) does not reveal any repeats as far as 5 kb upstream of the RIRE 

35 retrotransposon. Repeats are commonly speculated to be involved in imprinting 
(Alleman and Doctor (2000) Plant Mol. Biol. 43:147-161). However, this analysis 
indicates that this is very unlikely to be the case for the imprinting mechanism of 
the Z/nF/Et gene. 
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The 5' upstream region of the FIE2 gene is about 6 kb long as estimated 
between the transcription start site of the ZmFlEZ longest cDNA (Accession No. 
AY061965) and the retrotransposon MILT LTR. The extensive BLAST search of 
this sequence against the public and proprietary databases did not show any 
homology to known sequences, suggesting that the 6 kb 5' upstream region of the 
ZmFIE2 gene is its unique intiegral part. Dot plot analysis (not shown) revealed the 
complex pattem of repeats positioned along the 6 kb upstream region (Figure 4B; 
Figure 6). The sequence between -1161 and -3479 consists of three types of 
repeats, named A. B, and C. Repeats fomri a 2.6 kb symmetrical structure having 
the following order A1-B1-C1-B2-A2. The B3 and C2 types are repeated again (- 
5328 to -6077) fomiing one more cluster. Repeats A1-A2 are 550 nt long with 95% 
homology; B1-B2-B3 are 350 nt long with 94% homology, and C1-C2 are 420 nt 
long with 93% homology (Figure 6). Repeats do not share any homology or 
features of the transposable elements. They form a unique configuration and may 
be considered as a potential cis-regulating element of the ZmFIE2 gene. The 
basal promoter of the ZmFIE2 gene is estimated to be about 768 bp if framed 
between ^393 and -1161, which marks the transcription start of the longest EST 
and the beginning of the B2 repeat. 

The CG Composition oftlie ZmFIE Genes in Relation to Imprinting. 

As discussed above, ZmFIE expression is regulated by imprinting but in a 
different temporal fashion. The paternally derived ZmFIEI allele is pemnanently 
silenced during kernel development. Expression of ZmFIE2 undergoes less 
stringent temporal imprinting, because the paternal allele is reactivated later in 
kernel development (after 10 DAP). It has been widely speculated that imprinting 
is mediated by DNA methylation. CpG island methylation may be a key molecular 
mechanism of imprinting (Wutz et al. (1997) Nature 389(6652):745-749: 
Thorvaldsen etai (1998) Genes Dev. 12(23):3693-3643; Reik and Dean (2001) 
Electrophoresis 22(14):2838-2843). Recently a two-island rule was proposed to 
define genes regulated by imprinting (Onyango etal. (2000) Genome Res. 
10(1 1 ):1 697-1 710). In this reference, comparative analysis of human and mouse 
imprinted genes revealed that two or more CpG islands are associated with 
imprinted genes, while at most one GpG island is associated with nonimprinted 
genes. The CpG islands were defined in this reference as sequences of about 200 
bp with a GC content >50% and an observed-to-expected CpG content >60%. 
These criteria were applied for searching for CpG islands along the FIE loci. 

This analysis revealed three CpG islands within the ZmFIEI locus. One 
island is located between -2968 and -3219 (Figure 7), which corresponds to the 
retrotransposon segment and very likely is Inrelevant to regulation oi ZmFIEI. The 
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other two islands are located withinjthe ZmFIEI coding region, which agrees with 
the two-Island mle. The first of these two CpG Islands Is 252 bp and is positioned 
between -^87 and ^Z74, just downstream of the ATG codon. The second of these 
two CpG islands is 572 bp long and is located at the 3* end of the gene, between 

S -•-431 5 and -^4886, covering the last two introns and axons. 

Only one CpG island is present in the ZmFie2 locus, at position *231 to 
+88, around the ATG codon (Figure 7). This agrees with the definition of hbn- 
imprinted genes, which are associated with at most one CpG. island (Onyango et 
al. (2000) Genome Res. 10(1 1): 1697-1 710). \ 

10 These data suggest that the imprinting mechanism of ZmFIEI is very likely 

associated with DNA methylation of two CpG islands. The delayed expression of 
the paternal ZmFIE2 allele, which could be considered as a temporal imprinting, is 
not associated with DNA methylation. The complex repetitive structuife of the 5* 
upstream regton may be responsible for this type of imprinting. 

15 

Phylogenetic Analysis of Plant FIE Proteins. 

ZmFIEI and ZmFIE2 genes are mapped to chromosome 4 (bin 4.05) and 
chromosome 10 (bin 10.3). These regions are duplicated in the maize genome 
(Helentjaris (1995) Maize Newsletter 69:67-81 ; Gaut and Doebley (1997) Proc, 

20 Natl. Acad. Sci USA 94(13):6809-6814). It is very lil^ely that the two ZmFIE genes 
are due to the allotetraploid origin of the maize genome (Gaut and Doebley 
(1997), supra. Presence of two FIE genes in the maize genome raises the 
question whether two FIE genes exist in other species as well. A search by 
TBL^ASTX of the public EST database reveals accession numbers for 1 1 species, 

25 and putative FIE proteins were reconstructed. The FIE protein belongs to the 
Polycomb Group (PcG) proteins, which include the Drosophila extra sex combs 
(ESC), and mammalian embryonic ectodemi development proteins (EED). To 
make the phylogenetic analysis more robust, the PcG proteins from five insect and 
two mammalian species were included. A phylogenetic tree was constructed 

30 using the PAUP program (Figure 8). The phylogenetic tree forms four major 
clades corresponding to mammals, insects, monocots, and dicots. The 
Arabidopsis FIE protein is positioned apart, reflecting the absence of the protein 
from related species. 

So far all analyzed plant species show the presence of the one putative FIE 

35 protein. The phylogenetic tree demonstrates that the sorghum FIE and ZmFIE2 
proteins are more closely related to each other than ZmFiel protein. Thus, a 
ZmFiei analog has not yet been found. This does not prove the absence of 
homologs to ZmFIEI in other species, but the probability is very high that FIE1 is 
unique to the maize genome. 
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Discussion 

ZmFIE Genes Are Differentially Expressed. 

In understanding the role of ZmFIE genes, it is crucial to know in which 
tissues and cells these loci are active and whether two genes are active in the 
■ tissues at the same developmental times. The ^rai)/dop5/s single FIE gene 
(AtFIE) is expressed in many tissues, both reproductive and vegetative, indicating 
that this FIE protein may have multiple functions during plant development. iAfF/E 
is expressed in the embryo sac before fertilization, and its expression continues in 
the embryo and endosperm after fertilization (Ohad etaL (1999); Luo etaL (2000) 
Proc. Natl. Acad. Sci. USA 97(1 9): 10637-1 0642) Loss-of-function alleles of /\iF/£ 
demonstrate pleiotropic phenotypes» including initiation of endosperm 
development without fertilization, embryo abortion at early stages, premature 
flowering by seedling shoots, and flower-like structures along the roots and 
hypocotyls (Ohad at al (1999) Plant Cell 1 1:407-416; Kinoshita ef a/. (2001 ) Proc. 
NatL Acad. ScL USA 98(24):141 56-14161). These results suggest FIE protein 
encoded by a single-copy gene in the Arabidopsis genome may form distinct 
complexes in different plant tissues and partidpate in repression of several 
developmental programs. 

As has been shown by RT-PCR, the ZmFlEI gene is active only in kemels 
after pollination, but ZmFIE2 has very broad expression in virtually all tissues, 
much like the Arabidopsis FIE. Because both ZmFIE genes are expressed in 
developing kernels, their expression in this organ have t:>een studied by different 
methods to understand whether these genes have a functional redundancy. The 
RNA gel blot experiments revealed significant differences between the 
transcriptional activity of these two genes. ZmFIEI RNA revealed the inducible 
pattern of expression with a maximum activity around 9 DAP. ZmFIE2 RNA is 
detected at a steady level across the various developmental stages as very low- 
abundance transcripts. Moreover, the FIE genes are active in different tissues of 
the developing kemels. ZmFIEI is active in the endosperm, as shown by the 
MPSS RNA profiling experiments (Figure 1). The small number of tag sequences 
detected in the embryo tissues may be explained by contamination of the embryos 
with endospemn cells during tissue dissection, particulariy in view of the sensitivity 
of detection in MPSS experiments (1 molecule per million). ZmFIEI cDNA is not 
suitable for MPSS analysis, as it lacks the restriction site for enzyme Dpnil. which 
is used to generate tags. But, in situ hybridization experiments have shown that 
the Zn\FIE2 transcripts occur in the embryo, not in the endospemn, suggesting that 
these two ZmFIE genes are active in different tissues of the developing kemels. 
Thus the expression pattems argue in favor of the nonredundant function of these 
two FIE proteins in developing kemels. 
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Of importance is the pattern of ZmFIE expression in the female 
gametophyte, i.e., the embryo sac before fertilization. The Arabidopsis FIE mRNA 
is found before fertilization in the embryo sac (Luo et ai (2000) Proc. Natl, Acad. 
ScL USA 97(19):10637-10642), confirming its function as a repressor of 

S endosperm development. Expression of ZmFIE is different in the female 
gametophyte as well. ZmFIEI mRNA is not detected in ovules by RNA blot 
analysis or by MPSS profiling (Figure 1). The high sensitivity of MPSS provides 
strong evidence of no basal expression7or low expression, of ZmFIEI in ovules 
before pollination. Conversely the in situ hybridization data show a detectable 

10 amount of ZmFIE2 RNA in the embryo sac. Out of these two maize FIE proteins, 
only FIE2 is a candidate for a repressor of endospemn development before 
fertilization, the function performed by the Arabidopsis FIE protein. Loss-of- 
function mutant analysis will confirm this function. 

1 5 ZmFIE Genes Are Regulated by Imprinting. 

The prominent feature of the Arabidopsis FIS genes is their parent-of-orlgin 
effect in developing seeds (Grossniklaus et aL (1998) Science 280(5362):446-450; 
Ohad etal. (1996) Proc. NatL Acad Sci. USA 93(1 1):531 9-5324; Luo etai (2000) 
Proc. NatL Acad. ScL USA 97(1 9): 10637-1 0642). The wild-type patemal alleles do 

20 not rescue the maternally derived mutant alleles (Grossniklaus et a/.(1998) 
Science 280(5362):446-450; Ohad et aL (1996) Proc. NatL Acad. ScL USA 
93(1 1):53 19-5324); and paternally derived allele expression is delayed {FIE and 
MEA) or nonexistent (F/S2) (Luo aL (2000) Proc. NatL Acad. ScL USA 
97(1 9):10637-1 0642). FIS genes are regulated by imprinting, emphasizing the 

25 importance of maternal control of eariy seed development. 

To investigate the possibility that ZmFIE genes are also imprinted, several 
experiments were conducted to monitor the paternal and matemal FIE RNAs in 
developing kernels. Both genes show silencing of patemal allele expression with a 
distinct temporal pattem. 

30 The ZmFIE2 patemal allele shows no detectable activity until 10 DAP. This 

pattem of silencing is very similar to AtFIE in which imprinting is in force until 3 
DAP and later breaks down (Luo ef aL (2000) Proc. NatL Acad. Sci. USA 
97(1 9):1 0637-1 0642). 

The ZmFIEI paternal allele shows no expression at any developmental 

35 stages (Figure 2). resembling in this aspect the Arabidopsis gene FIS2 (Luo et aL 
(2000) Proc. Natl. Acad. ScL USA 97(19):10637-10642). ZmFIE and AtFIS2 are 
different types of proteins, but they are encoded by genes with very specific 
patterns of expression in the endospemn. FIS2::GUS activity was observed only in 
endospenn of the developing seed (Luo et aL (2000) Proc. NatL Acad. Sci. USA 
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97(19):10637-10642). ZmF/£t expression is also limited to the endosperm. This 
suggests that genes that are expressed only in endosperm, similar to AtFIS2 and 
ZmFIEI, undergo more stringent, permanent imprinting. Genes that are expressed . 
in both embryo and endosperm, like AtFIE and MEA, are regulated by less 
S stringent, temporal imprinting, which causes a delay in expression of paternal 
alleles, and subsequent breakdown of imprinting later in development. The 
ZmFIE2 gene belongs to this group, which is regulated by a temporal type of 
imprinting. 

10 Mammalian Models of Imprinting May Be Applicable to Plants. 

ZmFIE genes have a differential parent-of-origin activity and are regulated 
by permanent and temporal types of imprinting. The presence of repeated 
sequences is a common feature of epigenetically silenced and imprinted genes 
(Alleman and Doctor (2000) Plant Mol. Biol: 43:147-161). Fragments of 12 kb of 

15 the Mol 7 genomic loci of ZmFIE have been sequenced (Figure 4). A complex 
repetitive structure is found 5' upstream of the ZmF/E2 coding regton. Repeats 
occupy the 2.6 kb fragment adjacent to a putative promoter and a 1 kb fragment 
further upstream. The entire 6 kb upstream fragment does not share any 
homology to transposable elements, which are abundant sequences of the 

20 intergenic regions in the maize genome. It appears that the structural repetitive 
complex upstream of the ZmFIE2 gene is an integral part of this gene and may be 
a cis-element regulating ZmFIE2 activity. A critical aspect of the ZmFIE2 
expression is the delayed activity of the paternal allele in the developing kernels, 
referenced herein as temporal imprinting. The upstream-positioned repeats may 

25 be involved in setting imprinting marks on the ZmFIE2 gene during 

gametogenesis. It is possible that specific proteins that function as activators or 
repressors of gene expression bind with these repeats. These complexes might 
be temporally associated with the upstream sequence but degraded during kernel 
development. 

30 The genomic sequence of the ZmFIEI gene does not possess such 

obvious structures as repeats. Moreover, the promoter region of ZmFIEI is 
relatively short, approximately 780 nt between the putative RNA start and the LTR 
of a retrotransposon RIRE (Figure 4). The special feature of the ZmFiel gene is 
the 290 bp intron positioned at the 5' untranslated region. The first exon and intron 

35 are often required for high level expressbn of the reporter that may be a result of 
the increased level or stability of the mature cytoplasmic mRNA constructs (Kim . 
and Guiltinan (1999) Plant Physiol. 121(1 ):225-236): Clancy et al. (1994)). It is 
very likely that the 5* UTR intron of ZmFIEI plays a regulatory role or determines 
the tissue specificity of FIE1 protein expression. There are no indications in the 



wo 03/078580 PCT/IJS03/07552 

-23- 

literature that introns are involved in genomic imprinting. It has been proposed 
that CpG islands might be common imprinting elements in mammalian genes 
regulated by imprinting (Wutz ef a/. (1997) Nature 389(6652):745-749). 
Methylation of these islands during gametogenesis aeate the imprinting signals 

5 that maintain expression of the maternal or paternal alleles. The comparative 
analysis of mouse and human imprinted domains suggests a two-island rule for 
imprinted genes (Onyango etal. (2000) Genome Res. 10(11):1697-1710). 
Imprinted genes show two or more conserved CpG islands upstream or with the 
gene, while non-imprinted genes are associated with at most one CpG island. 

10 CpG islands are normally unmethylated and associated with actively transcribed 
genes, but allele-specific methylation of CpG islands appears to mari< imprinted 
genes in mammals (Wutz et a/. (1997) Nature 389(6652):745-749). 

The distribution of CpG islands within the ZmFiel and ZmFie2 genomic 
sequences was searched using a definition of CpG islands as sequences of >200 

IS bp with a GC content >.5 and an observed-to-expected CpG dinucleotide content 
>0.6. This analysis revealed two CpG islands in ZmFIEI and one CpG in ZmFIE2 
(Figure 7). The results concur with a two-island rule. The ZmFIEI gene, in which 
the patemal allele is silenced during all stages of kernel development, shows two 
CpG islands. The ZmFIEI gene, which demonstrates a more relaxed type of 

20 imprinting, shows only one CpG island, implying a different mechanism of delayed 
expression of the patemal allele, which is not associated with DNA methylation. 
The data presented herein suggest that CpG islands may be the imprint marks in 
plants as well. 

This assumption generates several predictions that may be experimentally 
25 tested. Transgenic constmcts with a reporter gene placed between CpG islands 
should mimic the parent-of-origin pattern of expression of the ZmFIEI gene. A 
pattern of DNA methylation across the ZmFIEI gene can be tested in DNAs 
isolated from the male and female gametophytic tissues (pollen and ovules), and 
endosperm. This would provide evidence for differential methylation of the islands 
30 during gametogenesis and its maintenance during endospemi development. 
Further, imprinted antisense transcripts are observed in all major imprinting 
models in mammals (Fu et ai (2000) Proa. Natl. Acad. Sci USA 99(2): 1082- 
1087), which were proposed originally as the sense/antisense competition model 
for preferential allelic expression of the mouse Igf2r gene (Wutz etal. (1997) 
35 Nature 389(6652):745-749). 

The two-island rule can be used to predict imprinted genes in plants. In this 
manner, a search of 2,000 full-length transcripts of annotated genes reveals that 
10% of them fall within the category of two and more CpG islands. Relatively few 
genes are described in plants as being regulated by imprinting, but this approach 
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provides a potentially useful predictive tool for Identification of imprinted genes. 
Support for the relevance of this approach comes from the finding of the a tubulin 
cDNA (tuba4), which shows two CpG islands. Imprinting of the maize a tubulin 
genes (families tubaS and tuba4) has been documented (Lund etal. (1995) Mol. 

5 Gen. Genet 246(6):71 6-722). Moreover, expression of the sense and antisense 
transcripts of the a tubulin genes were demonstrated earlier (tDolfini et a/. (1993) 
Mol. Gen, Genet 241 (1-2): 161 -169). Having demonstrated the applicability of the 
two CpG island rule for imprinting in the maize FIE genes, It seems probable that . . 
this rule operates generally in plants, and suggests that the general mechanism of . 

10 imprinting may be conserved in evolution across the kingdoms. 

Two FIE Genes Reflects the Maize Genome Evolution. 

The ZmFIE genes are located in the regions of chromosome 4 and 
chromosome 10, which are very likely duplicated in the maize genome (Helentjaris 

15 (1995) Maize Newsletter 69:67-81 ; Gaut and Doebley (1 997) Proc. Nad. Acad. 
ScL USA 94(13):6809-6814). The phylogenetic analysis of the known plant FIE 
proteins shows that sorghum and the maize FIE2 protein are more closely related 
to each other than to the maize FIE1 protein (Figure 8). This observation concurs 
with the hypothesis that the maize genome is a product of a segmental 

20 allotetraploid event (Gaut and Doebley (1997) Proc. NatL Acad. Sci. USA 

94(13):6809-6814). These authors provided evidence that "at least some elements 
of the sorghum genome share a more recent ancestor with one of the two maize 
subgenomes than the two maize subgenomes share to each other" (Gaut and 
Doebley (1997) Proc. Natl. Acad. ScL USA 94(1 3):6809-6814). One can speculate 

25 that a segmental duprication of chromosome 10 around a centromeric region (Bin 
10.03) has its origin from the sorghum-related progenitor The orthologous region 
on chromosome 4 around the centromeric region (bin 4.05) canying the ZmFIEI 
gene might originate from the second ancient genome that was more diverged 
from sorghum. 

30 Despite the similarity between ZmFIEI and ZmFIE2 genes, they are 

differently regulated. The ZmFIE2 gene has a broad expression pattem whereas 
ZmFIEI expression appears to be restricted to developing kernels. These genes 
are regulated by different types of imprinting. The data herein strongly support the 
nonredundant function of these genes. ZmFIE2 gene is very likely to be a 

35 functional homologue of the Arabidopsis FIE genes with multiple functions during 
maize development, such as preventing endospenm development before 
fertilization, and may be involved in functions for embryo growth and control of 
flowering. The second maize gene. ZmFIEI, has evolved for a kernel-specific 
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function, most likely in endosperm development. Experiments with null mutant 
analysis will further elucidate the function of these genes in maize. 

Example 2: Imprinting of the Maize Endosperm»Specific Gene FIE1 Is 
5 Mediated by Demethylation of the Maternal Complements 

Significant progress has been made on revealing imprinting mechanisms in 
mammals, but no such progress has been made in plants. The undertying 
mechanism of mamwaWan Imprinting is differentia) DNA methylation of nfiatemal 
versus patemal alleles, a process that takes place during gametogenesis 

10 (Constancia M, el al.. Genome Res. 1998, 8: 881-900). DNA methylation means 
the occunrence of 5-methytcytosine instead of cytosine In the context of CpG 
sequence. The major function of cytosine methylation is transcriptional repression. 

Most of the CpG sites in higher eukaryotes are methylated with the 
exception of CpG islands, which are stretches of DNA enriched in CG 

IS nucleotides (Ponger et al., 2001 .Genome Res 11:1 854-1 860). Imprinted 
mammalian genes show differential DNA methylation in CpG islands (Reik, et 
al.,2001, Nat Rev Genet 2, 21-32). Onyango et al. (Genome Res, 2000, 10:1697- 
1710) reported that the mammalian imprinted genes show two or more CpG 
islands within gene sequences, an observation referred to as the two-island rule . 

20 As shown herein, the maize FIE1 gene is imprinted and contains two CpG islands 
in its genomic sequence. This suggests some similarity between imprinting 
mechanisms in plants and mammals. The role of cytosine methylation In imprinting 
of the ZmFIEI gene was further investigated, as follows. 

25 Results 

DAM methylation assay ofZmFIE genes in leaves, embryos and endospemis. 
To investigate whether cytosine methylation occurs within ZmFIE genes and 
con-elates with imprinting, a quick and simple method was developed; it comprises 
DNA digestion with methylation-sensitive restriction enzymes, followed by PGR 
30 amplification across the restriction sites, PCR amplification of digested DNA 
occurs only if the cytosines were methylated and thus protected the DNA from 
digestion. 

Commonfy used enzymes Hpait and Mspt were chosen for (his analysts, 
but any other methylation-sensitive enzymes or mixture of several enzymes could 
35 be used. Both enzymes recognize CCGG sites, but show different sensitivities to 
cytosine methylation (New England Biolab catalog). Hpall does not cut DNA if 
either cytosine is methylated. Mspl cuts DNA with the internal cytosine methylated, 
but does not cut DNA when the external cytosine is methylated. 
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PGR primers positioned across the restriction CCGG sites will amplify the 
Hpall/Mspl digested DNA If CCGG sites are methylated. PGR reaction on 
unmethylated Hpall/Mspl digested DNA will fail. 

The restriction maps of ZmFlel and ZmFie2 genomic sequences (Figure 9) 
show a distinct distribution of Hpall/Mspl sites (CCGG) across the genes, 
scattered along ZmFIEI and grouped in a cluster in ZmFIE2. 

As shown previously, the ZmFIEI gene has two GC-rich segments defined 
as CpG islands. The first island is located within exon 1. The second island covers 
exons 11-12 and 3'UTR. The islands have two and three Hpall sites, respectively. 
There are also Hpall sites in exon 7 and exon 10. Four pairs of primers were 
designed around clusters of Hpall sites to monitor cytosine methylation in CCGG 
sites (Figure 10). 

The ZmFIE2 gene has one CpG island within exon 1. Eight Hpall sites are 
grouped there. No Hpall sites are present in any other segments of the ZmFIE2 
gene. One pair of primers was designed for the ZmFIE2 gene (Figure 10). 

DNA samples isolated fn^m embryos and endosperms of 14DAP kemels of 
reciprocal crosses between public inbred lines B73 and MolT were digested with 
Hpall and Mspl enzymes separately. DNA extracted from leaves of B73 inbred 
was used as a control. PCR amplification of an equal amount of undigested and 
digested DNA was performed and PCR products were visualized on agarose gels. 

For the ZmFIE2 gene, none of the digested DNAs support PCR 
amplification, indicating that CCGG sites within ZmFIE2 are unmethylated in 
tissues tested (leaves, embryos, endosperms). These results are in good 
agreement with the expression pattem of the ZmFIE2 gene. As shown previously, 
this gene is expressed in all tissues throughout development. The unmethylated 
status of the gene is consistent with its transcriptional activity. 

Conversely, a specific pattem of cytosine methylation across the ZmFIEI 
gene was found. CCGG sites within CpG island 1 and exon 7 are methylated in 
both cytosines because Hpall and Mspl digested DNAs are amplified effectively. 
This pattem of cytosine methylation is present in all tissues tested (leaves, 
embryos, endospemis). But CpG island 2, which is located in the downstream 
portion of the gene, is methylated very weakly in embryo and leaf DNA. and is 
barely detectable by PCR in the endosperm. Results clearly demonstrate that 
there is a gradient of cytosine methylation along the ZmFIEI gene, being heavily 
methylated at the 5* end and unmethylated at the 3* end of the gene. DNA 
methylation of the ZmFIEI gene correlates well with a repressed status of this 
gene in alt maize tissues except the endosperm. As was shown previously, only 
maternally transmitted ZmFiel allele is expressed in the endosperm; maternally 
transmitted ZmFiel allele must be demethylated in the endosperm DNA. 



wo 03/078580 



.27- 



* 

PCT/US03/07552 



Maternally derived fiel alleles are demethylated in the endosperm 

Status of cytosine methylation of the maternally- and paternally- transmitted 
ZmFIEI alleles In the endosperm DNA was deterniined by means of two SNPs 
(single nucleotide polymorphism) present in exon 1 of B73 and Mo17 Inbred lines 

5 (Figure 1 1 ). PCR primers were designed around the SNPs and Hpali sites. If both 
alleles were methylated at '"C'^CGG sites, the sequences of PCR products would 
show traces of both SNPs. If only one allele were methylated at "^C^^CGG sites, 
the sequence of PCR products would have SNPs from only one parent. 

To facilitate direct sequencing of PCR products, ZmFIEI gene-specific 

10 primers were extended with T3 and T7 primers at 5'ends. DNA isolated from 
embryos and endosperms of the B73 and Mo17 reciprocal crosses was digested 
to completion with Hpall and Mspl enzymes. The digested DNA was amplified by 
PCR, and the fragments were sequenced with T3 and T7 primers. 
Chromatograms of the nucleotide traces of PCR products from embryo DNAs 

IS showed a mixture of SNPs from both parents, B73 and Mo17. This is strong 

evidence that both parental alleles are methylated in the embryo. Conversely, the 
chromatograms of PCR products generated from the endosperm DNA show SNPs 
from the paternally transmitted alleles and complete absence of traces from the 
maternally transmitted alleles. Undigested DNAs, used as a control, showed a 

20 mixture of traces from both parents. 

Discussion 

This indicates that the ZmFiel paternal allele remains methylated in the 
endospenm, but the matemal allele undergoes de-methylatlon followed by 

25 transcriptional activation. Data suggest that the methylated state is a default for 
the FIE1 gene; thus transcriptional activation of the matemal fie1 complements is 
achieved through demethylation. The patemal allele remains methylated and 
transcriptionally inactive during endospenn development. Maternal-specific de- 
methylation explains the mechanism of imprinting of the ZmFIEI gene. It is very 

30 likely that demethylation of the maternal genes is taking place in the central ceil of 
the female gametophytes before fertilization. 

All publications and patent applications mentioned in the specification are 
indicative of the level of those skilled in the art to which this invention pertains. All 
publications and patent applications are herein incorporated by reference to the 

35 same extent as if each individual publication or patent application was specifically 
and individually indicated to be incorporated by reference. 

Although the foregoing invention has been described in some detail by way 
of illustration and example for purposes of clarity of understanding, it will be 
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WE CLAIM: 

1 . A method of identifying innprinted genes in a plant, comprising identification 
of two or more CpG islands located partially or completely within the coding 

5 region. 

2. The method of Claim 1 wherein said plant is of the species Zea mays. 

3. A method of identifying plant genes involved in endosperm development, 

10 comprising identification of two or more CpG islands located partially or completely 
within the coding region of said genes. 

4. A method of silencing patemally-transmitted alleles of a plant gene, 
comprising transfomnation of a plant with a construct comprising at least two CpG 

IS islands operably linl<ed to the coding sequence of the gene of interest and a 
promoter that drives expression in plants. 

5. A method of detecting cytosine methylation in a polynucleotide of interest, 
comprising: 

20 (a) Restriction of said polynucleotide with methylation-sensitive 

restriction enzymes, followed by 
(b) PCR amplification using primers positioned across the restriction 
sites for the methylation-sensitive enzymes wherein PCR 
amplification of digested DNA will occur only where methylation 

25 protects the polynucleotide from restriction. 

6. A method of controlling plant gene expression in the endospemi, 
comprising demethylation of CpG islands in the allele contributed by the female 
parent. 

30 

7. The method of Claim 6, wherein the plant is of the species Zea mays. 



35 
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M 

Hi^HGTGCCGTTTGGTTCACATATTTGTAACGTAATGGGTAACAGATA 
ACGTTAAATCATGTTTGTTTTATTTCAACCGTAATCAGATACCACATTA 
AAATTTGATACCAGACTATTCAAATTTGTTAACGCCAGTAATCGAGCGC 
AAACCATTACCATTTGCGXTACATTTTTTGAACCAAACAGCACflEiigB 
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>ZriFIE-1 5' upstream 



80 Sequence 5506 BP; 1538 A; 1215 C; 1048 G; 1703 T; 2 other; 

CCGATCATTC GTTTGTTCGA TCATTTGATC GTTCATCGTT CGTTCATAGT TCCTATTCAT 60 

CGTTCATCGT TTGTTCATAG TACTTATTCA TCGTTCATCG TTCGTTCATA GTTCCTATTC 120 

ATCGTTCATC GTTACTATTC ATCGACACTA TTCACCATCG TTACTATTCA TTGTTACTAT 180 

TTACCGGCTC TATTCGTCAT CGTTACTATT CATCGTTGCT ATTTATGGTA GCTTTTTCGT 240 

TGTTACTATT CATCGATCAT CCGATCGCCC CAAATTTCAA CTACTCATCC ATCATGTTGT 300 

CCAGTCCAGC TAAGACCAGC CAGACCCATA TTCCAGTCAT ACGAACTCCT GTGATTGTGA 360 

TTTTCCTTCC AGTAGGGAAC CTCCCATCTG GTCACCCATC CTAGGTTTCT CGAAGTTGAG 420 

CATGCTTAAC-TTTGAGATTC CTTTGAACCA GGCTTCCAAA CTCAGATTCC AATAATTCTT 480 

GTTTCTAAAT TCTTATCAAA CTATTCCCTA TCCAACCATG TCATCCCTTA AGCCTGGTCC 540 

ATATTCCAGA AAACTCCCAA AATACTCTTG TCCCATATTC TGCATATAAC TCTCCTGTTC 600 

ATACTAAGTC AGACGATTCA TTCGTCACTA TTCTCACCAA CAGTGAACTT CACTGTGCTA 660 

CACCACATAC ACTCAGCTAT AAATACACCC AGCTACCCTC TCCCTCTCCA CACACACTCA 720 

ACACCCTCAG CCAAGGCAAA CACCTCACCC ACTCAGTTAC TCCGCTCTAC CGGCTACACG 780 

CATAGTGTCG CTTCGCCTCC AGTCCACCCT CCTGGTAAGC ACCTCCGCTC CACCACCAGT 840 

AATATCACAA CACCACATGA CACAGATTCT ACTCAAGACT CTACCCATCC ATATATCGCT 9O0 

ATTCTGACCA CTATACTTU^ TATTTGTTGG TATACTTGCT GGTTTGTATG TTTGCTTGTT 960 

CATGTTGCAT AGTTATCGGA GCGTTCGTGC CATCACGTGG AGGCCAGATC TGCAAGTCTA 1020 

CGCCAGGCGG TGGAGCCAGA AGCCAGTTCC GCGAGCTCTC CTTCCCCCTT CACTGGATAA 1080 

GCACAGCAAG CTCACTGGAT CCCTTTGATG CATAAATTAC CTATGATTTT TCAACCACAA 1140 

CCCTCAGCCT GTTATTTTAT GCATAATATG ATTTTGAGAC AAGTTATTAT GGCCACCCAG 1200 

CCGCTTGTCG CAATCAATCC TTGATATATT TGTTACAAAT GATTTGAGAA AAGGTGTGAG 1260 

TTTTCAAAAG AAAATGCTTT TCAAAATGTG TATGATGAAG GGTTTTCACC CTTATCACCT 1320 

TTTAATAGGG ATGATCAAGG ACTCCCTGGT TTAGGGGAGG GCCTAAGGTG ATGGCTCAGC 1380 

TGGTTTAGGT GTGAGCAGAA GGATTGTCCC CTCACATAAG GACCGATTTG TCATCCGTCA 1440 

CTACCTGTAC TCATGATAAG TACAACCACT CGAGACTGTA TGGGCAATCA CTCAATCTGA 1500 

ACTCGTACGG TCCAACCCTA GGGTTATGAA GGCTGGGGAG CACCGGGAGG ATAAGGAGGG 1560 

AGAATGTTTT GTCCGGTTTG GACATGGCGG TGGCCTGACT CCTTCCGGTA TAACCGTTAA 1620 

GGtAAGGACG TGCGAGGAAA GAAAGAGATC CGGCATTCGG GCCTCACGAC GGTGAGATCG 1680 

CAGAAACCAG ACTAGTGGGT AAAGTGTACC CCTCTGCGCA GAGTTTGAAA ACCTATTCGA 1740 

ATAGTCTGTG TCCACAGGAA TGGACGAGTC TGGTGTGGTA TGACAATTAG TGTTTTGTTT 1800 

TCAAAAAAGA ATGTGCGTTT GAGAAAAGTG GTTTTTAAAA GGTCCGGCGG TTGAGCCGTG 1860 

AGCTATGGTG GACGGGAAGT CCAGTAGCTG TTTTTGAAAA CGAAAACCAG TGGGAAACTG 1920 

CTGAGATACC TGGATGGTTT AGTCCAGGGG ATTTTGTTCT AATATTGAAA AAAAATTCTT 1980 

GCTCCTTTGG GAGAGGATGC GCTTTGC7VAA ATACAAAATG TTTTACAAAA TAACCCTGCA 2040 

TAAAATATTG TTGTTTCTGC AAAATATCCT GAGCTCCACA TATTCCATGC ATTATATCTG 2100 

ATTTCCCCAT TCCGCGGGTG ATGGTGGGCT GCTGAGTACG TTTGTACTCA CCCTTGCTTA 2160 

TTTGTTGTTT TTCAAAAAAA GGAGATCGGG TAAGAGTTAC GACTGTTCCC AACCTTGCCT 2220 

GTGGTTGTTG GACCGCTGAT TTGCTTCGCT GCGTATATCG GGCTGCTTCA TCCCCACTCT 2280 

GATGATATGT CCCAAGTTGT GGACCAACTC TTAAAGTTGA TCGCCACCTT TATAGGTTTG 2340 

TCTCGTTTAA GCAGATCTGG AATCATTTGA TGTATAAATG TGTTTACTAG CCTCCTGGGA 2400 

CTAGTAATTG TATCACATTT GAGTCCTAGA GGATCGGGAC GCTTCAATGA TCAATGGGTG 2460 

GATCACAATA GTCGGTTATA ATGGCTATAT CAACAGTTAT AATCACATTA AATGTGTCAT 2S20 

CAGATGTTAG ATAAAGTCTG TCGTGGATGA TCTGTTTGTG CTTCTCGACG GTCCATGAGT 2580 

GACGCTAAAA TTCATTTTAC CAAACCTAGC ACCTTCGAGT TGGTCTGATC TTGAATAGTC 2640 

AGACX3GTTCA CGACTGAGGT TGAACGATCC ACGCAAGGTG TTGGACGATA CTTT CT T rrr 2700 

C nTG GATGC TCCXSTAGTAG ATGTGTCGGT TTTGACATAG TTCCTGTCCG AACTCCATAC 2760 

AGTCCATAGT AGATGTGTCG GTTTTGGTAC TCTAGACGGC CCGAGTCAGG GGTCTGGACA 2820 

GTCCTGGACT TGCTGAGTTG AGGTTTGATC TTTCTTTAGT TATTTCTTAC ATACCTATGT 2880 

TCATACACTT AGCAAACTAG TTAGCTTCAC CAAAACAAGT GTGGAAAAAG GTTTTTAGGC 2940 

CAATTTCCCT TTCACCTTTA TAACTACCTA GTTACAAAGT AGAGTTTGAT AGTCCCTAAG 3000 

TATGTCAATT CACATCTTGA GTACATGCGA CAATCTCATG TCTAAGGATA CATGGTACAG 3060 

GTTGCAAGAA GAAAATTGTC ACAATATCTC ATGTTGGGTC AGTACAGACT CATGTCATAC 3120 

ATGCACCCAT ATTATTAGTT TTACATCTCC ATGTCCATGA CTTACGAAAC ATAGTCATCA 3180 

ACTAATACAT ATGATAGTCA TTGACTCTAA CTAGGGACAT CTTCTAGAAC AACCATACAA 3240 
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GAAAAQAGTC TCACAAACAA TTCACATAAT TGCTAATCAA TACAAGGTGT CCTTCACAGA 



3300 



TATTCAATTA AACAATATAT CATGGATGCA ACAWAATATG CTCATCTCTA TGATTATCTC 
TAGGGCATAT TTCTAACACA ATGACAT6TC TAAGTGTAGT ATGTCAAAAC ATGGATAGTA 
ATATAGATGG TAAGAGGTCA TTTTTATTAA TATAATTAAC AAAGATAGAT AGGGTGACCA 
ATTTTGTAAA AGCACCATTC ATAGACTTTT AGTGGGAGGT GGATGCTCTA CCCGCCTCCG 
TAAAGCCAAA GTGGTTGCAT GCAAATTGYT AGGATATAGT AATGCAAGGA ACCAAGCTAA 
GGCATGTAAG TGAAACCCAA ACAAGAAGTT AAGAAGCTTC CAAAATGAAC AAAGTACAAG 
AATGAAGCTA AAAGAGAAAC TTTCAGCCTT CtCCAATCTC CAGCAAGATC CCTTCGATAG 
ATGGTATCTA ATTTTTTCCT ACTATGAAAA CGTATATCAC CTAGTAGAAT AGAGGACAAA 
GCTTACGCCT ACTATATATA TCCAATATGT ATAGTTAGAT ACTAAGTTCT TTTTTCTCTT 
CTCTTCATTC ACTTTTCAAC TAGGTTTGGA ATTAAGTTTT TGGATTGGCA TAGACAATGG 
CATGGTTGTA TAGGTGTTCT TAACCATCAC AGTTATGAGT TTGACTTGTT TTTTATATTC 
AAGTTACAAG GTCATTTTGT GCTAGCCACA GCCTAGCAAT CGAGGGGCTA CACATGTGGA 
TTAAGGACAA GGCCCAACCC ATGTACGATC CAAGGACACC CTTGTAATTT TTATACTCAT 
CAAGGATTAG GGGGAAATAA CTCCCTTCTA TATAAAGGTC TTTCCACTTT 6CTTCTCACT 
CTCCCTTATT AGGTTAAA CA CAAAATG TGC ATCGCCGCCG CCACCATATA GAACCACTTA 
TCACGAACCG CCGCCATCAC ATCCACTGCC TCAACTAGTG TTACCACCTA TQGTTCATTG 
TTGTGTCTGC TTCTTGTAGC ACTGTTGGTC TACAAACATT CATATTTCTC TCAACATCTG 
GCACAGGTAA GCCCATAAGC CCTAACCCTA GATCTCCATA TTTAGTTATT TCAGTTCTTG 



3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080. 
4140 
4200 
4260 
4320 
4380 



ATGAGCAAAT ATGAAACTAA ATTAGTTTGC TAATAAGAAA TTTAACTACT TTTCCTCTTG 



TGCACTTCCT TAAATCCCTA TACTT CACAG 
GTCACTTCGT GATATCACTG CCACCGTTGC 
CTCATCATCG ACGAACGAGG GGAAGCAACA 
GGACATCCCA CCATTACCGC CGGTGGTGGT 
TGAAGTAGTG GAAGGGCTAC TCGTGCCTAG 
GTATACTGTG GGAAATCACC CGATCTATGC 
CTATGATGTC TTTGCCATCG CCAGTTGCAA 
TTGTTTGCTA TCCTTTTATC CTATTGAGGT 
GAAGTTATGA TTTTGGTGGG ATTGGATCTT 
GGTTATGATG TGTGGCTTTA TGGTAGGGAA 
GCATCACGAG TAATGGTTTG GGAAATAAAA 
GAGCATGCCC TTTTACATAG GGACATTTTA 
ACACCGTAAT TATGTCACAA TCAAAGAATC 
AACTAGTTCA CATCTCTATA CCTAACAATA 
>ZmFlel coding region 

AAGCTTTGAG ACTTGATTTG AAGTATTAAA 
TAATACACTA TTCAATGACA ATGCACTTCC 
TTCCAAAGCA CGCCGAAAGA GGTCACTTCG 
TGTTGCCAAC TCX3AAACCTG GCTCATCATC 
AAAGGAGGGT CCACAGGAAC CGGACATCCC 
CCCACGACAA GGATTAGGAT GTGAAGTAGT 
AGAGTACAAG CCCAATAGCA AGTATACTGT 
CAATTTCATT GACATGCGCT ACTATGATGT 
CAACCGACTT CTCCCTACCT CTTGTTTGCT 
TCTATATGGT GAACGAAAAT GGAAGTTATG 
CTAGAAAAGG ATTTGAGTAC AGGTTATGAT 
CTTTTCCTAT TTTGTTTTTT GGCATCACGA 
ATTTAAAATT ATTTCTCAAT AGAGCATGCC 
CACAGACTTT AGTCATTTTA CACACCGTAA 
TTCAATTGAA TGAGATGATT CAACTAGTTC 
ATAACTACAA TTCTTAAAAA GAATTAATAT 
CTTATAATAT AATACATTTG TCACTCCCAT 
TGGAATGTTA CTATTTTTAA TTTGATTAGA 



CATGCCGCCT 
CACTGGGCCT 
TGACAAGAAA 
GAATATAGTC 
TCGGAAGCGA 
CATCGGGTTC 
TAGTGTAAGC 
TTGGGGAGTT 
GGTTTATAAC 
ACTTAATATC 
GGGAAAATGA 
GTCATTTTAC 
ATTCCTTGGT 
TAQTTTTTCA 

TAAACCCTTC 
TTAAATCCCT 
TGATATCACT 
GACGAACGAG 
ACCATTACCG 
GGAAGGGCTA 
GGGAAATCAC 
CTTTGCCATC 
ATCCATTTAT 
ATTTTGGTGG 
GTGTGGCTTT 
GTAATGGTTT 
CTTTTACATA 
TTATGTCACA 
ACATCTCTAT 
GAACCTAAAT 
TTTGGCAAGG 
AGCTATAAGC 



TCCAAAGCAC 
GTTGCCAACT 
AAGGAGGGTC 
CCACGACAAG 
GAGTACAAGC 
AATTTCATTG 
AACCGACTTC 
CTATATGGTG 
TAGAAAAGGA 
TTTTCCTATT 
TTTAAAATTA 
ACACACTTTA 
TCAATTGAAT 
TAACTA 

AAATTTCTTT 
ATACTTCACA 
GCCACCGTTG 
GGGAAGCAAC 
CCGGTGGTGG 
CTCGTGCCTA 
CCGATCTATG 
GCCAGTTGCA 
CCTATTGAGG 
GATTGGATCT 
ATGGTAGGGA 
GGGAAATAAA 
GGGACATTTT 
ATCAAAGAAT 
ACCTAACAAT 
ATTATTTCAC 
GTGGTGGGTA 
TTTGGCTATA 



GCCGAAAGAG 
CGAAACCTGG 
CACAGGAACC 
GATTAGGATG 
CCAATAGCAA 
ACATGCGCTA 
TCCCTACCTC 
AACGAAAATG 
TTTGAGTACA 
TTGTTTTTTG 
TTTCTCAATA 
GTCATTTTAC 
GAGATGATTC 



CTAACTTTGA 
GCATGCCGCC 
CCACTGGGCC 
ATGACAAGAA 
TGAATATAGT 
GTCGGAAGCG 
CCATCGGGTT 
ATAGTGTAAG 
TTTGGGGAGT 
TGGTTTATAA 
AACTTAATAT 
AGGGAAAATG 
AGTCATTTTA 
CATTCCTTGG 
ATAGTTTTTC 
TTTCTTGCCC 
TTTTGGGGGA 
TTTTTATTAG 



4440 



AAGACCTCCT 


ATCCCTATAT 


GAACCCACAT 


CCAAAACCCC 


TCTAGCAAAG 


TGTGGCTAGC 


4500 


TTTCCCATGC 


CATGAACCTT 


CAACAATGAT 


AGTATCAGTA 


ATGCACTTCC ATAAAAGGGT 


4560 


TCATATTTAA 


TTTTAGTTTT 


TCTTTTTGGT GTTTTAATTA 


AGCTTTGAGA 


CTTGATTTGA 


4620 


AGTATTAAAT 


AAACCCTTCA 


AATTTCTTTC 


TAACTTTGAT 


AATACACTAT 


TCAATGACAA 


4680 



4740 
4800 
4860 
4920 
4980 
5040 
5X00 
5160 
5220 
5280 
5340 
5400 
5460 
5506 

60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
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GAATTTGATG TTCATTTTCA ATATATTGTG ATCTATTTTC TTAAAATGTG AATTTGTTGT 1140 

GTATTTTGAT TAGTTCGATG AAGAGTGTTT ATAAGATATG ATTTTTAAAT TCTCTTACGA 1200 

CGAAACAATA TTATGTTACT TTCATCTATT CATCTTGAGQ AATCACCTAC CTCACTTCTT 1260 

GATCTTGCAG GTGATAATTT ACCGATGCCT TGAGAATGGT GGTTTTGGTC TTCTACAAAA 1320 

TTATGTTGAT GAGGATGTGA GAAAGACAAT GCCTGGTGCA TGTGGTTGTT AATGTTAATT 1380 

TGATAATATG CTTTTATCTA ATGTCTGTGG TGCCTATTTA TCTCAGAAGG ATGAGTCATT 144 0 

CTACACTCTA AGCTGGACCA TCGATCAAGT TGATAGCTCA CCGCTGTTGG TGGCCGCTGG 1500 

AAQCAATCGG ATCATTCGGG TCATCAATTG TGCTACCGAA AAGTTAGATA AGGTCCCTGC 1560 

CCCTGTGCTT ACTCTATOTT TGTATGGAAA AGTTGATTGA ACGTTGATGT TCACATATCA 1620 

ATATTTCAGT AGTTTAGTTG AAATACAATT TATTTATGCT CTCTATTCTT GAACATCAGT 1680 

TGACTTTGCT TTGATTAAGC AATGGTCTTG CTCATACAAT ATTCTAGGAG TTGAATATTC 1740 

AATATGCCTG TTACATGATA GCAAATACAT AGTGAACTAG GACATGTACT AAATATTTAA 1800 

TTTCCCTTTA TGACATTCTC TAGAGCTTAG TTGGCCATGG TGGTTCAATA CATGAGATAA 1860 

GGACTCATGC CTCGAAGCCA TCACTCATCA TTTCTGCCAG CAAGGTTAGT AATAAATTTG 1920 

TCGTGTGTCG ATTTTTTTAC ACTTTTTAAC ATGACATTAT TCTATAGGAT GAATCTATTA 1980 

GGCTAtGGAA TGTCCATACT GGGATTTGCA TCTTAGTCTT TGCAGGQGCT GGAGGCXATC 2040 

GACATGATGT GTTGAGTGTT GTAAGTATCG ATTGCATCTT GTCTAGACAT TGTTTTAAAT 2100 

ATCACTTGCC CCGAAGATAA CACTCATTAO AATTCT7UVTG. TTACCATTTG TTATTGAGCA 2160 

TGCCAAATTT CAATTTTAAC ATCATAGATA AAATAAGACC CCACAATTAC TTTTACTGTT 2220 

TATGTACTTC CATTACATTA GGCATAAAGT TACTGATAAA AAAGACAATC TTTTATCTGA 2280 

AGGACTTCCA CCCTACCGAG GTTGGGATTT TTGCAAGTTG TGGCATGGAC AATACTGTGA 2340 

AGATTTGGTC AATGAAAGGT TTGGGAACTA CTTTAAACTA GCTTCATGTT TACATTTTGT 2400 

GTTGTATGTT GCATATCATC GACAAATATT GCCAATGTTG TCACAGAATT TTGGATATAT 2460 

GTTGAAAAAT CATATTCATG GACTGGCCAT CCATCAAAGT TTCCAACGAG GAATATCCAG 2520 

TTTCCGGTAT GTTAAGTAGC TATAATCACC TGAGCTCCTT TCTTTTTTTG CAAACTATTG 2580 

TTGGTGTTCA GTTTTCATGC CATTCAAGCA TACATGTTTC TTTTCTTTTA GGTCTTGACT 2640 

GCTGCAGTAC ACTCTGACTA TGTTGATTGT ACCAAGATGG CTTGGTGACT TCATCCTATC 2700 

AAAAGGTAAA TTCTTCATTT GTTAAATGGC TATACATTTT TTTATAAAGG AAATTTTTTA 2760 

TTAATTTCAA GCACTTTAGA TTGAAATAAT ACAAAATCTT AAAAAACATT TTTGGCCTCC 2820 

ATTTAAACAA GCACAAATCC AACAAAAATG AGTAAACCAA CCCATTCTAG TGAATATTAA 2880 

TGCATAAACT AGATTGCTAC CCATATGTCT AGAAAAAGTA GCCTTGACCG CGTATCTTTU^ 2940 

TTGTCACCAT GCCGCCACAA CCAAACCGTG CAAATATGGT TTTTGGAGAA TGGACCAAGT . 3000 

AAGAAACCAA TCAATAATTG AGTATATAGC ATGCACAGGA GAAATAGATC TCTTATTTTC 3060 

AAGAACAATG GTATTTTTTA TTAACCATAG GACCAACAAG TAGCGACTAC CCATAGCAAA 3120 

ACTAATGGCT TCAGATTATT ACTGGTTGTT GAAGTGTATA CGTGGTTTGC CTACTTTCTC 3180 

CCAATAGTTT AAGCTTTTGG ATTGAATCGA TTAGTGCOTT CACTCTTACA TGGTATCAAA 3240 

GTTAGCAATT TTGGGTTTGA ATCCTAACGG AAGCTTTATT TGTGACTTCA CCTCTTGTTT 3300 

TCCATTTCCT TTCTACCTGC ACGTGAGTGG GGGTGTTGAA GTGTATAAGT GGATTGCCTA 3360 

CCTTATCAAC CTTTTGGATT AAACTGGTTA TTGGTTAGTG TGTTCACTCC TACACCTAAG 3420 

TATGAGGTTT AGTTATCCAG TAGCCAATTA GATTATGCAC AGTGGACACT TCACATGTGC 34 80 

AACTAGCACT CAAAACATAA GTCTTTAATT GTCTCATCTT ATGACAAAAC AACATATTTC 3540 

ACTACCATTC TATAACATCT TGATTTGTAC ATCAGTCTTG TTAATGCTAA ATAGTGAGAT 3600 

TTGATCGTCA ATTGGCCAGT TGGATGTAAA TTCCAGTGAA ATACATCTTG ACCTTGGGTT 3660 

AAATGGACAT TAGCAATGTG TGGGAACAAA TTGTTGGTTT GGGTACACCA AACTGTTGGT 3720 

TTTTAATTAG TAGATTAGTT TGTAACACAT TTCCTTTTAT CAGTGTTAGT ATTGGTTTAT 3780 

TATGCATAGG GAAGGATCTG ATATGTGATA ATTAACATQG ATTTGCAGAG TGTAAAGAAT 3 84 0 

GCAGTTTTGC TTTGGGAACC AAAACCAGAC AAGCGTAGGC CTGGGGAGGT GACACGCTTT 3900 

ACCTTCTCGT CCCGAATTCT GCACCTATTT TTATATTACT ATCATACTCA TCTACAGTTT 3960 

AAAACTTGTC CCGCAATCTT TTCAGTTTCT GAGCACTAAA TTTATACCTC TGAATCAGTA 4020 

TAGTCGTTTT CTCTTTGTTC GTATAGGGGA GTGTTGATGT TCTTCAGAAG TACCCGGTGC 4080 

CAAAGTGTTC ATTATGGTTT ATGAAATTTT CATGTGATTT TTACTCCAAC CAGATGGCAA 414 0 

TAGGTAATGC CTTTAATTTT GTGAAGACTG TTTTGGCACT AAAGCTTTAC GTACGTAATA 4200 

TTAGTTTTAT ATCTTGTACA TTGATGGAAA ATAGATTGCT CAATATCTAT ATATATGACT 4260 

ATATCTTGGG TTAGATTCTA AGGAACAAAC TCTCCCAGAG TACX3GTTCTG AATAACAACC 4320 

ATCTGCTGCT GCTGCTTAAT GCGAACAGGC AACAATAAAG GOGAGATCTA TGTCTGGGAA 4380 

GTGCAGTCCA GCCCGCCCGT CTTAATTGAC CGGTAAATTT CCAGTTCTTC TCCTCCTCQC 4440 

ATCGGTTCCT GCATGGGTAG CTAGCTAGTA ACTCCGACGC TTCTGCTGGA TGCAAACACT 4500 
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TGTGCATTTT CAGGCTGTGC AACCAGGAAT GCAAGTCGCC GATAAGGCAG ACCGCAGTGT 4560 

CATTCGACGG AAGGCACGTA CGCACTACGA CTCTCACTAT CTGCTCATGC ATGCATTCAC 4 620 

CGCACGTACG TGTGATGTGC TCGCTCGCTT CCTCCTTTTG TGATGGTGTC TCTCTCACTT 4680 

GCCCAGCACG ATCTTGGAGC CGCCGACGAC GGCGGATCTG GCGCGGTGGG ACGAAGTGGA 4740 

CCCTGCTGCT TCCAGCTCCA AACCTGATCA AGCTGCTGCG CCCGCCGCXTG GTGCGGQTGC 4800 

CGACGCCGAC GCCGACGCCT GAGC6AGAGG ACCGTCGTCG CCCGCCGGTT CACATCGATC 4 860 

GTACTCCGTQ CTGGCTGATT ACCTTTACCC ATTGGGATGT TTTGGTTCAG AGTCGCCAGA 4 920 

TCTAGTGTGT GGCTGAACGT TGAATGTTAG GATGCTGCTG CTTGTTATGC TCTGAGtCTT 4980 

GAGTTCTCTT TGTTAATTTG . CACCGTGGAT GAGATGAATA ACTTGACGTT GCAACrTTOC 5040 

ATCCCATATA TGCCGTAAAT CTGCCGTCTG TTGTTTGTTC TGCGTTGTCT AGAATTAQTG ..SllOO 

GAGATGTGCT GGATACAATO TATGCTAGTC TATTAAACCG TGCTCCACTC TGAGATAATC S160 

GACCAACTTG TCTTATTATT GAAAGAACTG TGGAAAAAAC CAAAAAAAGT CGTTGTGGTT 5220 

TTGTTTATTA TCAAATATAT TTTACATAAG ACTTAAAAGT TTTCATTTTT TCATGAATTT 5280 

TTTGAATAAA CCGAGTAGTC AAAGCTAGGG TCAAAAAGGC AAACATATTA TATTTTAAAA 534 0 

TGGAGAGAGA GTACATTGTT TTAAGACGAA TTGTTTAATA CAACTCGAGA ATATTCTGAT 5400 

ACATTAATCC TATGATATTA CCATAAAAAA CATTAATCCT ATGATAGAGT GTATAATTAC 5460 

AAATGCACAA AGGTTCTTTT CATGTGAAAT CGTATTATAG ATAGGGGTCA TAGCGCGCCC S520 

TTGTCCCTAC AACTTACGAT GTTCATGAGT TAGQTTAGAA AAAGGTTAGA GCAAGTATAC 5580 

TAAAGTGACA TATGCAGGCT ACAAGGAATG CCACATCAGA TTTTTGGTGA CGTTQAAGGA 5640 

AGAAAAATAG AGGGAGAAAA AAGCGAACCA ATTGCGAAGG TGCCTTCTTC CAAGGGCACG 5700 

GTCCATGGAG TGTGGTAGCC GACATCAAGG TAGAGGATTA TGGTAAAGTT ATTTGAGCAA 5760 

GTGTCTGACA ACTAGCATGA AGGCTTAGGA TTTTCTAAAT GCATCTTTGA GCGCTATTGA 5820 

TGTAGATGTT AATGATTTTT AGGGCTGATG ACCAAACCAA AGATGAACAT GGGAACGNAA 5880 

GGAAGGTTAC TGAAAGTGTA TAGGCCCCTA GTTTAGTCTT CAGTGACTAA TGATAATATA 5940 

TATTATTGTG ACTAACAAGT GTTTTATAGA AACANGGAAA GTTAGATCAC AATAATAGAT 6000 

ATGATCAGGA TTATTATGTG GTACCCATCC CTTATTGATG AAAATCAATG GTTGGTTCTC 6060 

ATAGGATAAT CGAAAAGGTT AAGGATCAAC TGTAAATGGA GTTGTTGGAC ACTTAGAGTA 6120 

GTGATTTGAC CTTTTTTCTT TGGTAGTACT ATAAACGGAC ATGAAATGCG TAGCTTTACC 6180 

TAAACAAGTC TAGTTAAGTA TGATGATGCA CACTTGTGAA TACTAGTGCT AGGTAAACCC 624 0 

ATGAGATCTC ATGTGAAGTT CGAAACAAAA CCTAATTCGA AAAGTGATTA AAACATGTGA 6300 

CTTAACAATG TTGTAGTAGC ATTGGTCGAG TTTGATGGGC ACCTGATATG GGTCACTAGA 6360 

CATGAGTGTG CCCTGTTGTG TTTGAGTGAA GCACTAGCAT ATCAGGTGTG CAACAGATAT 6420 

GGTGCACCCA GGCAGGACAC CCAAAGAGCT TGCAAAATTA GCCTAAAACA CTTAGTGCTC 6480 

ACCAGACATA TCTAGTGTAC TACTAGTTAT TCTCGTTATA TATGAACCCT ATTAGTTATT 654 0 

CTTGAATTGC TTCGATCTTT TACAAAGGAA GTAGTTTTTC CTTCATCTCC ATAAACTGTG 6600 

GTTTTCCAAA GGCATTAATA ATAAGATTTA GTATATTAAA TTCAAAGTTG AGGTACTTTA 6660 

TTATCGTGAA ACCAACATTA ATACTATAGA CTTAACTAAG GAGTCTATTG GTGCTTCCTT 6720 

CTCATGTATT TTCTTCTTGA AGTGTTCCTT CATCTTGGTG CTAACGACGA CATTCAACAA 6780 

TGTGTGCTCT TACTTGATT6 GTTTGTATAT ATGGTGGTGT TCCTTTACTT AGTGGCAACA 6840 

TACCTTATCG ATAACTAACC CTTAGTGAAA GAAATGAAAA TGTACATCCC ACTGGGAAAT 6900 

CACTCATACC CCTAAGAGCT AACTTAATGG AACATCACTC ATAGCCCTAA GGGCTAGTTG 6960 

GAAGTACTTT CTCATTTCCT GTATAAGGGC TAGTTCATGA TTCAACTTCT TCTCCATTTC 7020 

TTGGTGAACT ATCTTAGCAC GATTCCTATA AAAACATATA CAACTAAACA AAGGGTGGTG 7080 

GTACTGAACA CAGTGGACCC AAGCACTCGG AAATGGGAAG GACAAGTTGC ATGGAAAAAA 7140 

CGACAGGCTG GGAACTATTG TGTCTTGTCA AGCGTGTTCG TCCAGCTATA GGACATGGGT 7200 

ATTTATAGGG CAACTAGAGQ TTGGTATCCT AAAATATGTC CAGACCCCTA GTTATCAACT 7260 

ACGTTCCTAG ATAATACTGT ACAACAAGGT AATTATAGAA TAGTAAGTTT GTTATTCTAA 7320 

CTCCACCCCG ACAGGTGGGT CCGTTGTCGC CCGGTTGAGA GTGGGCCCTG CTCGGCCAGG 7380 

TCATTGGCAT TGTCCGTGCA GACGTGTTCC CAATATCGAG GCAATGAAGT TGTTTGACAC 7440 

TTCTTCGGGA GTCGGCGTGA GGCCTTCGCT TGCTAGCGCG AACTTGCCCA CGAGCGTCCT 7500 
CACCATGGGC CCCGCTGACA AGCTT 
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X TTTTTCACAC CCTTACTOTC ATCTAACACA ACCAGOTACA AACTTCTTTT TCGTTTTCAA 
£1 GTC6AATTTT GAGGGGCAAA CCATAGTTGC ACTTCCATCG AGGGACAAAA ACACAATTOC 
121 CCCTTAACTT ATATAGTTAA ATATACTTAA CGAGCTTGCT ACTGAGACTA ACAAGTCAAA 
181 ACTATTOOCT TGACCTTATA TTAGTTTTGT CTTACACTTT ACAATCGTTG ATGGCTGCTC 
241 TAGATCTTAT AAACTTAAGA ATATTATGAC TTTATCACTT TATTTGTAAT GGATGTATGG 
301 ATACTCATTG ATGCATTATT TATGGTATAA ACTATAGACC ATGAATGTAT GGTGTAATGC 

361 TATAGTATAT TGTTAGACTT GTGTACATAT ATATTATTTA TACTTAACTC ACAAACTTAA 

■4.21 TGAGTCAGCT CGAACTTATA AACGACCTGA GTCGACCTGG CCTTATGGCT TGTTAAGATA 

4 81 ACAAGTCAAA CCAAGCCGAA CTGACTCGTT ATCCAAATCT ACACTTACAT AAACAAAACA 

S41 TGATTTCAAA TTAAGATTGG TACAAAAGTG TTTTGTTTTA TTCAATTAAA CCCTACACTG 

601 TACTCTTTAT GTCAACAATA GTTGATGCTA CQACAAAGCA ATGAACATTT TATGGAGTAG 
. 661 TTA ATTTrAT TOTCCtAATO TCAATTACTA TtOTTAOCCA AjOOAATGOAG TAAOCCAATA 
731 AAOAGTACAT ATCTACQAOO AAATTTAGAT ATGTGCGtAA CTTimAXT CGXOATACAX 
781 AATOTOCAAX ATAAOGGTCC ATGTAACATA CATATATTIC TT6TTTTTAX GOXAAAAGAO 
841 TOTATAAACT ATAAAOOTTO TTOCTTAOAA OCGGGATTTA ATAACATOOO TTTTATATTA 
901 ACCTTAA6TC CCTATOCAAT ACCTGTATTT TTTTCTAAGT ACAT60TACA AACACAAATA 
961 CACACATTTA AOCACACATA CTCACTTGCT ATGAGCACAC ACACGTAAAC CCTACTCCTA 
1021 CTAGCACCTT CAAAAGACAA AATAGATAAA TCTTGTTOAC AAAOTCTATT GAAAAATATC 
1081 AACGTCCGGT CTAAATCTTG ACAAAATATT AGCACTTGTG CCAAGTTAAG AAGTGAGCAC 
1141 TTGAACGTAA GTGGTTAGAG GAACCTAACC AAGTTAGTTA TGTTCAATTP TTCATCCAAG 
1201 TTAGCTTGCT AGTTTTTCTA TACACAAACA TTATATTAGC TTATACCATT GTTGGGAAAT 
1261 TCTAACTTTA ATGATTTCTT TGAGAAATCC ATAAGAGCGA TAAAGAGGAG AGAGAGAGAG 
1321 AGCAAGAGAT TTGTACATGT ATAAATACTA TCCATTTTCT ATTTAAGAAT CTAGACAAAC 
1381 TAGCAAATAT AAATTTGAAA CATAATAAAG ATGGGCACCT GGCATCTCCT 6GATATTAAA 
1441 AGCGTACCAT TAAAGATATA CATAATTATT CACCTCTTCT AGGTATAAAT TACCCTACTA 
1501 CCACATTCCC CTATCTCTAC AAACTCTCTC TCATTGACTC ATCAA6AGAG TGCCACCTCT 
1561 ATCTCTCCTT CTCTCTTTTC AAATGTTCTA CAATTATCAA CCATCATACA ACATTCACCT 
1621 TTCCTACCAA CCTTGriGAT GCTTGTCTCA ACTTTCTCTT TACCTAGATC ACTCATATAT 
1681 ATCCCTATTT CAAAGGCATT AATCATCAAA AACCTATAGA AAAATCCCAT TATCAACCAT 
1741 GATGGAGTCT GATCGTGAGA AACAACAGTC TCATGGCAAG AAACAAGGTG ACCATGGTAG 
1801 CAAGATGCAT GATTCTGATG GCAATAAAAA TGTGTCAGAT GAAAAGAGTC AAGAGTCTGG 
1861 TGGTAAGGAA CACAAATCCA ATATAAAGAA ACATGAATCA CGTAGAAAGA GGTAAGACAT 
1921 TCTCCTTGAA AATCTTGGCT TCAAACTCAA GTTAAATTTA TGTACACATG TTTATATAGA 
1981 GTCTAGAGAT TTTGTGCTTA ATATATGCAT GCACATGAGT TCAAATAATT TCATAATAAA 
2041 AATAAAAAAA TCAATATGAT CAGGAATTAA ACCATGAAAT TTTTAGAGAC ATCATCTAGA 
2101 TTGAGTTCCA TGGTCATACC ATGATGGTTA TGTCATTTCT. TTCCAATATA AAAAATTCCT 
2161 TAACTTATAC TCAAAATCTT GATTGGATOG AACTTTTTCT ATAGAATTCC TTGCCACATC 
2221 TTGTSTAACA ACCATTTGTA TTGGTTT60G TCTAGTCCAC TTTTGTGTGT TGCTATTATG 
2281 TAAATAATTA TTTTTCAAAT CCAAAGTTGT TCCTCCACAT ATCTAGAATA TATTCTAATT 
2341 CTACAAGAAT TTAAAATGAA TTGTTAACTT AAGAAT6CAT TGTTCAATAT ATTTAT6CAT 
2401 TTTCTCCCAT TATGATATAT ATATTCTCAA TATTTGGCAC ATAATAACTT GGAACATTCC 
2461 TTACATTTGT TGGGTTGA6T GCTATATGTT TGGATTCATT AATTATTTAC ATTGATATTT 
2521 TTGTAGATGT TTGTGTTTAC CCAATAAGAA AAGGCCATTA AGAAAATAAA A1GTTATTAG 
2581 ATAGAGTTAG TCTTGACATG TTATATTCTT TTAATAATTG GATTTTGTGG TATTTCCAAC 
2641 ACATTCCTTC CATTTAAACC TAACTCCATC TCTCTTATCT TCCTCTATCA TATACCTTAT 
2701 CTTCTTTCTA CACTAACACT AATGCTTATG TCACTCCTAA CCTTGATGCA ACCTACCAAT 
2761 AGTCAATTAC TGTTACGTTG CTAGAACCAA AGATTGGTCC ATTGGTGCAC AATCCATTAG 
2821 TTCCTCCTTC TTGGGACTCT TCAACCATCC TAACTCCCCA AATGATTTCA AAAGTTTTCC 
2881 CTACCATGTC ATCCTACTCC ATATCCAA7G TCTACTGGTG CTAGATTCTA TCTACTOTTA 
2941 QCACCAAACT AACCACAAAA TAATAATOCC TACAAATATA GGTGGAOaTO ATGTAAAATT 
3001 AAOOQAGOQG CAATTQTAAA TGOTAOXACC ATAOATATCA AACCTTCTCA ACTTAGAGCT 
3061 ATGTCTACAT AOTTCTAGTC CTATQAAQCA TCAACCATTT TCTTACTAAA CTAAATATTT 
3121 TTAGAGOAAO GQOTGOATCC TTACTTTCAT CTCCATGA6C TTCCACCCCT TCCTATOAGC 
3181 TTATCCATCO ACTGAAAGTT CCTCATTGCT OGAOCTTACC CGTTATTATC CCATGTCATC 
3241 TGACTTTTOT ATGTACTATT ATCTTTOAAO TCGTAGOCAT OTGGTAAATT CCTACCTTAA 
3301 OATCCArrAA TCCTCCAACA CACCCTTAAfl ACCCAAACCA TAACGCCTAA ATCCAATTTC 
3361 AACATATTTT AOGTOACATO GGTATATOTO ATATTAGTTA CTTAATATAO CAAGCTCTAT 
3421 CAATGATTTT TAGTCAGAAA ATGGTTQATA TGTTTT T AGT GGTTGTACTA TAATTGAAGA 
3481 GGCACATAGA GCAAGTTTTT AGACCATGAA TATATG6T6T AAACTATAGA CCATOAATOT 
3541 ATOOTGTAAT OCTATAOTAT ATTAATTATT AOACTTATGG ACATATATAT TATTTATACT 
3601 TAACTCACAA ACTTAATAAO TCAOCTCOAA CTTATAAACC ACCTOAOTOO AACTGOCCTT 
3661 ATO0CTC6TT AAGCTAATAA GTCAAACCAA OTCOAOCTGA TTCATTATCC AAATCTACAC 
3721 TTATGTAAAC AAAACATGAT TTCAAATTAA OATTGGTACA AAA6TQTTCT GTTTTATTCA 
3781 ATTAAACOCr ACACTATACT CCTTATGTCA ACAATAGTTO ATGCTACOAC AAAOCAATQA 
3841 ACATTTTATO OATTAGTTAA TTTTATTATC CTAATGACAA TTACTATTGT CAOCCAAQQA 
3901 ATGQAQTAAO CCAATAAAGA GTACATATCT ATOAGQAAAT TTAGATATGC OTCCAACTTT 



3961 


ATTTTTTTAA TCOAOATACA 


GAATGTGCAA 


AATAAGGGTC 


CATGTAACAT ACATATATTT 


4021 


CTTCTTTTTA TGGTAAAOGA 


GTGTATAAAC 


TATAAAGGTT 


OTTGCTTAOA AGCGOGATTT 


40B1 


TAATAACATC AATTTTATAT 


TAACCTTAAO 


CCCCTATCCA 


ATACATGTAT TTTATTTCTA 


4141 


AGTACCTGGT ACAAOCATAA 


ATACACACAT 


TTAAGCACAC 


ATACrCACTT OTTATOAOCA 
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4201 CACACOTAAA CCCTACTCCT ACTAGCACCT TCAAAAQACA AAACAGATAG ATCTTGTTGA 
4261 CAAAGTCTAT TT ATQGTATA AACTATATAC CATOAATOTA TGGTOTAATO CTATAOTATA 
4321 TTGTTAOACT TGTGTACATA TATATTATTT ATACTTAACT CACAAACTTA ATAAGTCAGC 
4381 TCGAACTTAT AAACOACCCG AGTC0AACT6 GCCTTATGGC TCOTTAAGAT AACAAGTCAA 
4441 ACCAAGCCOA GCTOACTCAT TATCOUUkTC TACACTTATA TAAACAAAAC ATGATTTCAX 
4501 ATTAAGATTG GTACAAAAOT GTTCTATm ATTCAATTAA ACCCTACACT ATACACCTTA 
4561 TGTCAACATT AGTTGATGCT ACOACAAAOC AATOAACATT TTATOOATTA OTTQATGCTA 
4621 CAACAAAGTA TATT0TTA6A CTTGCTAOAT TCTATCTACT OTTAGCACCA AACTAAGCAC 
4681 AAAATAACAA TCCCTATAAC TATAGOTGOA GGTGATGTAA AATTAAGOOA OOGOCAATTO 
4741 TATATGQTAO TACCATAGAT ATCAAACCTT CTCAACTTXO AGCTATOTCT ACATAOTTCT 
.4801 AGTCCTATOA AGCATCAACC ATTTTCTTAT ACTAAACTAA ATATTTTTAG AGGAAAGGGO 
4861 TGGATCCTTA CTTTCATCTC CATOAOCTTC CACCCCTTCC TATGAGCTTA TCCATCGGTT 
4921 GAAAGTTTCT CATTGCTAGA OCTTACTCOT TATTATCCCA TGCCATCTGA CTTTTGTATA 
4981 TGTACTATTA TCTTTOAAGT CGTAGGCATG TGTAAATTCC CACCTCAAGA GTCAAOATCC 
5041 ATTAATCCTC CAACACACCC TTAAOACCCA AACCATAACA CCTAAATCCA ATTTCAACAT 
5101ATTTTAGGTO ACATCGOTAT ATGTGATATT AGTrACTTAA TCTAGCAAGC TCTATTAATO 
5161 ATTTTTAGTC AGAAAATQGT TAATATGTTT TTAGTOOTTG TACTATAATT OAAGAGGCAC 
5221 ATAGAGCAAO TTTTTAOTCG TTGTATTCTA AACAATGATT GATGT GTATA AAT TTAATAA 
5261 ATTCATTGT7 GCATCTTGTG TTTCATACAT TTOAAAT6CT TTGTGCCTAA TCTATAT6GA 
5341 TGAAGAAGTA AATCCTTCTA AACTTTTCCT TCCCTGCAAT CTTTrTAAAC ACACTCTAAA 
5401 CCCCAAATAT CTAATCCTAA CCTCTAAACC TGATTTAAAT TTTCTAATCr AGTCCATTTG 
5461 TAGTGCTTTT ATATTTAGTC CATTTGCCTT ATGTGCCTCT TGTG TATAAA TAGCGTAGAG 
SS21 TTCTGTATAA TAGTCAACAA GTTTTGCCTT TTGTTGTCGG ATCCATTTTC AATCCTTTTG 
55 Bl TCTAGTTCAC CTATTGTTGT TGTGAAAAAA ATGTCACACA TTTTTTACTT CCCCCTATAC 
5641 CACATACTCC ATCACGGACT AATGATCTTC AAGGTATGTA TGCTCAGTTT AAATCCATGT 
5701 CTCCACATAC TCCATCTTAA GTTCAAGTCT CTACTTTAAG GTATGTAATT TTAAAACTTT 
5761 GACGTATTGT AATTCTATAA GGAGCAAATC TGAAAATTAA ATAAGGAAAA ACTGGTAAAG 
5821 GCATGTTTGG AAATCGGAAC GCAGACATTT TGTTGTTCCT ATGTTTTTCT TTAAATAAAC 
5881 TCATTCGTGT AAAATTTCTT CAAAATTCCT CTCCTTCGAA CAGATCCTTT TGCCCCCGGA 
5941 CCCCTTTCCT ACGCTTCCCC AAACCCACAA AACCCTCGCC GTCGCGCCGC GCGATTGCCT 
6001 CTCCOGCCGC COCGAGCCCO COACACTAGT AACGGTCTAC ACCAOCAGAA TOACTGAAGA 
6061 ATTGAATTCC AGCAAATTCA A6CTTTT0TT TTAGCCAAGA TTTGAGATTC GATTTGAAOT 
6121 OTGGAAOTCC TTCCAAXTTG CCAATCCTAT ATTTOATCTC TOCTGTGCTO OGTTAAATCC 
6181 CTAAACTTCA CAOCGCGQCO CCGGCCCAOC CACOCCGOAA OAGGTCGCCO 0GTQA0G7CA 
6241 GTGTCCCCGT TGCTGCCGCC TCTAACCCQA AGCCTAGGCX: GCTGCCOGTO CATAACAAOO 
6301 AGAATCAGOC GGAG06GAAA OTAOCAGAGG A06GGGCAGC AACTQAGGAO GGGGAOAAGT 
6361 ACCGGGCGOA ACCG6AAATC TTOCCGCTGC CQCCGGC CAT GG CGAAGCTG G 
>ZaPie2 coding region 

1 AAGCTTTTGT TTTAGCCAAG ATTTCAGATT CGATTTGAAG TGTCGAAGTC 

51 CTTCCAATTT GCCAATCCTA TATTTGATCT CTGCTGTGCT GCGTTAAATC 

101 CCTAAACTTC ACAGCGGGGC GCCGGCCCAG CCAC6CCGGA AGAGGTCGCC 

151 GCGTGAGGTC AGTGTCCCOG TTGCTGCCQC CTCTAACCCG AAGCCTAG6C 

201 CGCTGCCGQT GCATAACAAG GAGAATCAGG CG6AGGGGAA AGTACCAGAG 

251 6AGGGGGCAG CAACTGAGGA GGGGGAGAAG TACCGGGCGG AACCGGAAAT 

301 CTTGCOGCTG. CCGCCGSC CA TQG OGAAGCT OGGCCCXjGGG CA066GCT06 

351 GGTGCGACGC OGCOGAGGGG TCGCTCGTGC CCAGCCGGAA GCGGGAGTAC 

401 CAA6CCCTGC G6CAAGCACA CTGAGGG6AA GCGCCCGCTA TATGCT A TCG 

451 GGTTCAACTT CATGGACGCG CGCTACTACG ACGTCTTCGC CACCGTCGGC 

501 GGCAACCGCG TAAGCCATCG ACTGCTCTCT CCTGTCGTCC TT inT l U TT 

551 TCTACTGAGG TTTGGGGAGT TCTTGTTGAT TAATGGCAAG GTAAAACTAC 

601 GTTGTTTTTT TTTGTGATTT TGGTGGTCGG TTTTAGGAAG CGGTCGCTTT 

651 TGATTCAAAT TTGATCTAAA GCTGAGCCAT TCGGTTGTTT TTATTGGGGA 

701 CTTGAOGTCT GTAATGTTCC CACTATTGTG ATTTCTTTTG CCGAAACATG 

751 GAGTTTGCTA GTTCATnX^A T6AAAAGCTG CAACCTTTGA CAAAGAATTT 

801 GTATCACTTG GGAAAGTATA GTGACGTGTG GGGAATCAGA TAGTACCAAT 
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851 ATTACTTTGA..CTATGATTAT AACATAATCT TTTAATGTCC TTTGTAACGA 



901 CCATGCTGCT TTTCGCTTAT CTTGCCTATT GATCTTGCAG GTGACAACTT 



• 951 ACCGCTGCCr TGAGAATGGT AGTTTCGCTC TTCTACAAGC TTACGTTGAT 
1001 GAGGATGTAA GAAAOACAAT OCTCAATGAC AATGCTTTT6 CTTGCTGATT 
1051 TAATATTGAT AATATTCTTT CTCTAAtTCT TGTGA06CCT ATTTACCTCA* 
1101 GAAG6ATGAG TCGTTCTATA CTCTAAGCTG GGCTCGTGAC C^TGTT6ATG 
1151 GCTCACCACT GCTGGTGGCA GCAQGAAGCA ATG6GATCAT TCGGGTCATC 
1201 AATTGTGCTA CAGAAAAGTT AGCTAAGCiTA ATCTACCCTT ATATTTGTAT 
1251 GTGTTCCTAT G6TAAACTTG AATGAAGCCT TATTTGCATA ATTCAATATT 
1301 TCAGTTGTTT ATTTGACATA TATCACTTTA TTTAT6ATAT CTGATCCA6A 
1351 AGCTCTTTTQ GATTTGCTTT AGTTAAGGAA TOGTGCTTGC TACGCATTAA 
1401 TACCATAAGC AAACTGTACC TTTTGCTCAC AGAATATTGT TAATTTTGAC 
1451 TACTTCAGTA TGTCCGTTGT AGTAAAAACA AATCAACTTG GTGTATCTAT 
1501 TTTTTCCTTG CTTATACATA GCCAGGAGAT TGGGCATGTG GCATGTCAAT 
1551 AAATACTATC CTATACCATT TGATAGGACA CGCACTOTGT CTTATTTGGT 
1(01 AGCTCTGTTT AC6TGATTCT OCAGAGCTTT GTTGGCCAT6 GOGACTCAAT 
1651 AAAT6TSATA AGAACTCAAC OGTTXSAAGCC TTC6CTCATC ATTTCTGCAA 
1701 GCAAGGTTAT GCGATAGTCT GTTCTTAGGT TCATGTACCT TTTTATTTTT 
1751 ATAATCTTTC TGAATTTIGA CACCATTTCA TATGGCATTA TCTAATAGGA 
1601 TGAATCT6TT AGGCTATG6A ATGTCCATAC AGGGATCTGT ATCTTGATAT 
1851 TTGCT6GAGC TGGAGGTCAT C6CAAT6AA6 TATTQA6TGT TGTAAGTAOT 
1901 GCCTGCTATT AT6ACATTGT GCCCTTCAAA AAAAACATIA TTATGACATT 
1951 ATTTTTAGAA CATTACTAGG TTAAGGTGCC TTTAATAT06 CGCACTCTTT 
2001 CAGCTCCTGA TATTACCATT TGTTATTCAG CGTTACATCA GAGATAAAAT 
2051 AAGGCTACCT AATGACTGCT ACTGCTTTTC TACTTTGATT ACATTAGTCA 
2101 TAAATGTACT GATGAATACA TTATTTTCTC TTAAGGACTT CCATCCTACT 
2151 GATATTGAAC GTTTTGCAAG TTGTGGCATG GACAACACTG TGAAAATCTG 
2201 GTCAATGAAA GGTTA6AAAG CTACTTCAAA GTTCCTTCAT ATTTGCATGT 
2251 TGCGT6TCAT TGAGTTCACC AATGTTGTCO CAGAATmG GCTATATGTT 
2301 6ACAAATCAT ATTCATGGAC TGACCTTCAT CAAAGTTCCA CAAAATATGG 
2351 CCAGTTTCCA GTATGTTTCA CAATGCCTAT ATCCAATTAT CCTGGCAAGG 
2401 TCCTGTTGGT GTCTAATCCT CATGCCATCA GACTGACCTG ' ITTC ' mTlt; 
2451 TTTCAGGTCT TC A TT G CT G C AGTACACTCT AACTATGrTG ATT6AACAA6 
2501 ATGGCTTGGT GACTTCATCC TATCAAAGGT GAAATTrCTG ATTCGTTTAA 
2551 ATGGATACAA ATTTCTGTAG CACGGTTQTC ACTCTTTTCT GGGTTTGACA 
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2601 TGCCACTGTC TTGCSTTCATC TATTGCl'GTA CCGTGCAA<3T GTTCAGTTTT 
2651 TTCMTCTTT TTTCTCAGTG CTTAATGAGG GGAGATTCTA TTTGCAGAGT 
2701 GTTGTCAATG AAATTGTGCT TT6GGAACCG AAGACAAAAG AACAGAGTCC 
mi TCGGGXGCTJi ATTOvtnTTJv JlCTTtCCCWG MOTCTXTTC CTWTJCrWCT 
2801 GCCATATATT TACOCACAGT TGTAAACTAT TTCCAGATCC TTAGATTTCA 
2851 AGGTACTGGC TGCCAATATT AAATATGTTC CACTGAAGTA ATATGATTTT 
2901 Cl' G TT G CCTC ATAGGGAAGC ATCGATATCC TTCAGAAGTA TCCTGTCCCA 
2951 GAATGTGACA TTTGGTTTAT CAAATTTTCA T6TGATTTTC ACTTCAATCA 
3001 GTTGGOGATA GGTAATATCT CTCATCAGGA TTGTTTCTG6 TAGAAGTTTT 
3051 ATTTAAGATT TTTTTTGCTC TGTAAAATTT CACACACGCA CACATGCACC 
3101 CCCACACACA CkCACKPGCA CGCACACCCC CAOOCACCTG CA0GD5CGO5 
3151 TACftCACAOV CCGCACACAT ATATATGACT TTTTTTCCCA CACAAATATT 
3201 TGCT6TGTGA GATATCAGCA AATAAATTCG TATGTTTQAT TATATTCAGA 
3251 GATATAGQAA AATTGAGTGC TCTAATACCC CATCCACTAC TTCAAACAGG 
3301 CAACCGTGAA GGCAAAATCT ACGTGTGGAA AAATACAGTC CAGCCCTCCT 
3351 GTCCTCATTG CTCG6XAGTT TTCACTGGAA GAGTTTCAGT TATTCTTGTC 
34 01 TCCCACTTGT ATCCTCGCAT GCTTCTGGAT GCCAATGCTT CATCATTTTC 
3451 AGGCTGTATA ATCAGCAGTG TAAATCGCCG ATAAGACAAA CTGCAGTGTC 
3501 CTTCGATGGA AGCTACCTCA CTCTAATCCA TGCTCAATTT GGTGTACTGT 
3551 CTATTCTAGC ACTTGCTTTT TTCTTGGTTC TGCTTGAGAA ATTCTCGATT 
3601 OCATGTCATA TGCTOGTGCA rmVlTm ' TCTGTTTCCG TCGCGGATIG 
3651 GTAAAATGCG ACGAT60CTT CCTTATCTAG CACAATCCTT GQAGCTGGT6 
3701 AAGACG6CAC CATCTGGC GG TGGGATGAAG TG6ACCATCC QAGCTCCAGA 
3751 AACTGAAGAA 6TGTTGCCGC TCAATGCTGG ACTGATOGTT ACGCTOGGTT 
3801 GGGGTT6T6A TGGTIGAATC GGTTG6CGGA AAGTGCCACC TGGTGTTTTT 
3851 TTCTAGTCAA AATGGTTGAT GTTAACAGAA TATTGAATGC TTOGAATGTT 
3901 GAAAGTTGGG ATGCrTGT G C 7GGTACTCTQ CTC CGOGQAC GAGTGAACTT 

3951 AGTTTGTTGC aactttggga accgttgtca tctgtttgtt ctgcatttct 
4001 aaaaagagag caaatttcag gatacatgtt crrrmTTC actacaggaa 

4051 AACTAAGGTT GAGGTATTGC TTTGCAATTT ACTCTCTCTC TCTCTCTCTC 
4101 TTAAAAAAAC TGGATCTTGC TTCAACGATG CATTCCTTOG GTCATCGGTT 
4151 TTACTTTTGA AATCTTGATA GCTGGGCCTA AAGTTACCAA GCCCACTAGT 
4201 ATCAGAAGTA ATAATATGAT GGCTCCTCCC CTGCCTTACT GTCACGTGTA 
4251 AACTTTGGAA ACTAGCAGGA CTGTA6CATT TAGCGA6CTG GTTGTTTGGG 
4301 TTAGAGCTCA GCGTGSCAAC .TTATGOTACC GAGGTCAGTG TGAAGATCTA 
4351 TG6CACCAT6 GTTCAATCAC AGTTTTAGTC CCACCAAAAA TATAAAGGTG 
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4401 AAGTTTCGAC AAAAAATGGC . TAGAATAAAA AAAAACAGGT CCACATACTQ 
4451 AGGAGAACAC ATGACAGATT CACCAAGGAT TTTGAATTGA AAGAG6CTAA 
4501 TGATTGACAG GATTTGATCT TCAATTCCAC CTCCCGTTGT CCTGCTTCTA 
4551 CTCTAAAGTT CAAGCGTGGC TCAGTTTGCC TATCTGTTAT AATTTCAAGA 
4601 AATCCTGATT TCTGTTAGCA GTTTACTAGG CTATTAGGAG GAGCTGOQAC 
46S1 AAAAGAAAAA CQA6AATXGA CGAGGACAAA 1T06CAA7TA GTTGOOAAAT 
4701 TGGGGGCAa ATTTTCAATG CCCACAAAAT TCACTGCOCC TACHTNTCS06 
4751 GNGGAATGG6 GTCANKCCTC ANT6TCCCCT J6TTNC06GC3A CAAGTMTAAC 
4801 TAACACATTT CCNMATTNNT N 
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Primers for PCR amplification of ZmFie genes across CCGG sites 
Gene Biocode Primer sequence 



Fiel 


66766 


Fwd 


AATTAACCCrCACTAAAGGGCSCCGCCACCATATASAACCAC 


Fiel 


66765 


Rev 


GTMTACaACTCACTATKGQQC&TTGCAACTGGOQATQGC 


Fiel 


35573 


Fwd 


ATGASATAAGGACTCATGCCTOSMUiCCA 


Fiel 


44446 


Rev 


CCCACTCACGTOataGTMSMM 


Fiel 


53924 


Fwd 


AATGCAASTCCCCaATAACGCMaCCGCAG 


Fiel 


53926 


Rev 


CAACCMCKCOGMCTACSATCOATerSAA 


Fiel 


53925 


Fwd 


AOGCSMATCTATCTCTaSQAACTGCASTC 


Fiel 


34971 


Rev 


ATCGGCOACTTQCMTCC 


Fie2 


38860 


Fwd 


OSCeACACTAGTAACGGTCTACACOl 


Fie2 


36675 


Rev 


CGCGTCCATOAAGTTGAACCCGATAG 



Fwd - forward primer, Rev - reverse primer 
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SEQUENCE LISTING 
<110> Pioneer Hi-Bred International, Inc. 



<120> Imprinting in Plants to Control Gene 
Expression 

<130> 14B7-PCT 

<:i50> US 60/363,861 
<151> 2002-03-13 . 

<160> 6 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 17 
<212> DNA 
<213> Zea mays 

<400> 1 

gatctagtgt gtggctg 

<210> 2 
<211> 24 
<212> DNA 
<213> Zea Diays 

<400> 2 

cgtgaaggca aaatctacgt gtgg 

<210> 3 
<211> 29 
<212> DNA 
<213> Zea mays 

<400> 3 

cattacgtta caaatatgtg aaccaaacg 

<210> 4 
<211> 31 
<212> DNA 
<213> Zea mays 

<400> 4 

cagaacaaac agatgacaac ggttcccaaa g . 

<210> 5 
<211> 13031 
<212> DNA 
<213> Zea mays 

<220> 

<22l> unsure 

<222> (11384) . . . (11481) 

<223> N B A, T, C or G 



wo 03/078580 PCT/US03/07552 

2/8 

<400> 5 

ccgatcattc gtttgttcga tcatttgatc gttcatcgtt cgttcatagt tcctattcat 60 
cgttcatcgt ttgttcatag tacttattca tcgttcatcg ttcgttcata gttcctattc 120 
atcgttcatc gttactattc atcgacacta ttcaccatcg ttactattca ttgttactat 180 
ttaccggctc tattcgtcat cgttactatt catcgttgct atttatggta gctttttcgt 240 
tgttactatt catcgatcat ccgatcgccc caaatttcaia ctactcatcc atcatgttgt 300 
ccagtccacc taagaccagc cagacccata ttccagtcat acgaactcct gtgattgtga 360 
ttttccttcc agtagggaac ctcccatctg gtcacccatc ctaggtttct ccaagttgag 420 
catgcttaac tttgagattc ctttgaacca ggcttccaaa ctcagattcc aataattctt 460 
gtttctaaat tcttatcaaa ctattcccta tccaaccatg tcatccctta agcctggtcc 540 
atattccaga aaactcccaa aatactcttg tcccatattc tgcatataac tctcctgttc 600 
atactaagtc agacgattca ttcgtcacta ttctcaccaa cagtgaactt cactgtgcta 660 
caccacatac acteagctat aaatacaccc agctaccctc tccctctcca cacacactca 720 
acaccctcag ccaaggcaaa cacctcaccc actcagttac tccgctctac cggctacacg 780 
catagtgtcg cttcgcctcc agtccaccct cctggtaagc acctccgctc caccaccagt 840 
aatatcacaa caccacatga cacagattct actcaagact ctacccatcc atatatcgct 900 
attctgacca ctatactaaa tatttgttgg tatacttgct ggtttgtatg tttgcttgtt 960 
catgttgcat agttatcgga gcgttcgtgc catcacgtgg aggccagatc tgcaagtcta 1020 
cgccaggcgg tggagccaga agccagttcc gcgagctctc cttccccctt cactggataa 1080 
gcacagcaag ctcactggat ccctttgatg cataaattac ctatgatttt tcaaccacaa 1140 
ccctcagcct gttattttat gcataatatg attttgagac aagttattat ggccacccag 1200 
ccgcttgtcg caatcaatcc ttgatatatt tgttacaaat gatttgagaa aaggtgtgag 1260 
ttttcaaaag aaaatgcttt tcaaaatgtg tatgatgaag ggttttcacc cttatcacct 1320 
tttaataggg atgatcaagg actccctggt ttaggggagg gcctaaggtg atggctcagc 1380 
tggtttaggt gtgagcagaa ggattgtccc ctcacataag gaccgatttg tcatccgtca 1440 
ctacctgtac tcatgataag tacaaccact cgagactgta tgggcaatca ctcaatctga 1500 
actcgtacgg tccaacccta gggttatgaa ggctggggag caccgggagg ataaggaggg 1560 
agaatgtttt gtccggtttg gacatggcgg tggcctgact ccttccggta taaccgttaa 1620 
ggtaaggacg tgcgaggaaa gaaagagatc cggcattcgg gcctcacgac ggtgagatcg 1680 
cagaaaccag actagtgggt aaagtgtacc cctctgcgca gagtttgaaa acctattcga 1740 
atagtctgtg tccacaggaa tggacgagtc tggtgtggta tgacaattag tgttttgttt 1800 
tcaaaaaaga atgtgcgttt gagaaaagtg gtttttaaaa ggtccggcgg ttgagccgtg 1860 
agctatggtg gacgggaagt ccagtagctg tttttgaaaa cgaaaaccag tgggaaactg 1920 
ctgagatacc tggatggttt agtccagggg attttgttct aatattgaaa aaaaattctt 1980 
gctcctttgg gagaggatgc gctttgcaaa atacaaaatg ttttacaaaa taaccctgca 2040 
taaaatattg ttgtttctgc aaaatatcct gagctccaca tattccatgc attatatctg 2100 
atttccccat tccgcgggtg atggtgggct gctgagtacg tttgtactca cccttgctta 2160 
tttgttgttt ttcaaaaaaa ggagatcggg taagagttac gactgttccc aaccttgcct 2220 
gtggttgttg gaccgctgat ttgcttcgct gcgtatatcg ggctgcttca tccccactct 2280 
gatgatatgt cccaagttgt ggaccaactc ttaaagttga tcgccacctt tataggtttg 2340 
tctcgtttaa gcagatctgg aatcatttga tgtataaatg tgtttactag cctcctggga 2400 
ctagtaattg tatcacattt gagtcctaga ggatcgggac gcttcaatga tcaatgggtg 2460 
gatcacaata gtcggttata atggctatat caacagttat aatcacatta aatgtgtcat 2520 
cagatgttag ataaagtctg tcgtggatga tctgtttgtg cttctcgacg gtccatgagt 2580 
gacgctaaaa ttcattttac caaacctagc accttcgagt tggtctgatc ttgaatagtc 2640 
agacggttca cgactgaggt tgaacgatcc acgcaaggtg ttggacgata ctttcttttt 2700 
ctttggatgc tccgtagtag atgtgtcggt tttgacatag ttcctgtccg aactccatac 2760 
agtccatagt agatgtgtcg gttttggtac tctagacggc ccgagtcagg ggtctggaca 2820 
gtcctggact tgctgagttg aggtttgatc tttctttagt tatttcttac atacctatgt 2880 
tcatacactt agcaaactag ttagcttcac caaaacaagt gtggaaaaag gtttttaggc 2940 
caatttccct ttcaccttta taactaccta gttacaaagt agagtttgat agtccctaag 3000 
tatgtcaatt cacatcttga gtacatgcga caatctcatg tctaaggata catggtacag 3060 
gttgcaagaa gaaaattgtc acaatatctc atgttgggtc agtacagact catgtcatac 3120 
atgcacccat attattagtt ttacatctcc atgtccatga cttacgaaac atagtcatca 3180 
actaatacat atgatagtca ttgactctaa ctagggacat cttctagaac aaccatacaa 3240 
gaaaagagtc tcacaaacaa ttcacataat tgctaatcaa tacaaggtgt ccttcacaga 3300 
tattcaatta aacaatatat catggatgca acawaatatg ctcatctcta tgattatctc 3 360 
tagggcatat ttctaacaca atgacatgtc taagtgtagt atgtcaaaac atggatagta 3420 
atatagatgg taagaggtca tttttattaa tataattaac aaagatagat agggtgacca 3480 
attttgtaaa agcaccattc atagactttt agtgggaggt ggatgctcta cccgcctccg 3540 
taaagccaaa gtggttgcat gcaaattgyt aggatatagt aatgcaagga accaagctaa 3600 
ggcatgtaag tgaaacccaa acaagaagtt aagaagcttc caaaatgaac aaagtacaag 3660 
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aatgaagcta aaagagaaac tttcagcctt 
atggtatcta attttttcct actatgaaaa 
gcttacgcct actatatata tccaatatgt 
ctcttcattc acttttcaac taggtttgga 
catggttgta taggtgttct taaccatcac 
aagttacaag gtcattttgt gctagccaca 
ttaaggacaa ggcccaaccc atgtacgatc 
caaggattag ggggaaataa ctcccttcta 
ctcccttatt aggttaaaca caaaatgtgc 
tcacgaaccg ccgccatcac atccactgcc 
ttgtgtctgc ttcttgtagc actgttggtc 
gcacaggtaa gcccataagc cctaacccta 
atgagcaaat atgaaactaa attagtttgc 
aagacctcct atccctatat gaacccacat 
tttcccatgc catgaacctt caacaatgat 
tcatatttaa ttttagtttt tctttttggt 
agtattaaat aaacccttca aatttctttc 
tgcacttcct taaatcccta tacttcacag 
gtcacttcgt gatatcactg ccaccgttgc 
ctcatcatcg acgaacgagg ggaagcaaca 
ggacatccca ccattaccgc cggtggtggt 
tgaagtagtg gaagggctac tcgtgcctag 
gtatactgtg ggaaatcacc cgatctatgc 
ctatgatgtc tttgccatcg ccagttgcaa 
ttgtttgcta tccttttatc ctattgaggt 
gaagttatga ttttggtggg attggatctt 
ggttatgatg tgtggcttta tggtagggaa 
gcatcacgag taatggtttg ggaaataaaa 
gagcatgccc ttttacatag ggacatttta 
acaccgtaat tatgtcacaa tcaaagaatc 
aactagttca catctctata cctaacaata 
gatttgaagt attaaataaa cccttcaaat 
atgacaatgc acttccttaa atccctatac 
gaaagaggtc acttcgtgat atcactgcca 
aacctggctc atcatcgacg aacgagggga 
aggaaccgga catcccacca ttaccgccgg 
taggatgtga agtagtggaa gggctactcg 
atagcaagta tactgtggga aatcacccga 
tgcgctacta tgatgtcttt gccatcgcca 
ctacctcttg tttgctatcc atttatccta 
gaaaatggaa gttatgattt tggtgggatt 
gagtacaggt tatgatgtgt ggctttatgg 
ttttttggca tcacgagtaa tggtttggga 
ctcaatagag catgcccttt tacataggga 
attttacaca ccgtaattat gtcacaatca 
atgattcaac tagttcacat ctctatacct 
taaaaagaat taatatgaac ctaaatatta 
catttgtcac tcccattttg gcaagggtgg 
ttttaatttg attagaagct ataagctttg 
ttttcaatat attgtgatct attttcttaa 
tcgatgaaga gtgtttataa gatatgattt 
gttactttca tctattcatc ttgaggaatc 
taatttaccg atgccttgag aatggtggtt 
atgtgagaaa gacaatgcct ggtgcatgtg 
tatctaatgt ctgtggtgcc tatttatctc 
ggaccatcga tcaagttgat agctcaccgc 
ttcgggtcat caattgtgct accgaaaagt 
tatgtttgta tggaaaagtt gattgaacgt 
tagttgaaat acaatttatt tatgctctct 
ttaagcaaCg gtcttgcCca tacaatattc 
atgatagcaa atacatagtg aactaggaca 
attctctaga gcttagttgg ccatggtggt 



ctccaatctc cagcaagatc ccttcgatag 3720 
cctatatcac ctagtagaat agaggacaaa 3780 
atagttagat actaagttct tttttctctt 3840 
attaagtttt tggattggca tagacaatgg 3900 
agttatgagt ttgacttgtt ttttatattc 3960 
gcctagcaat cgaggggcta cacatgtgga 4020 
caaggacacc cttgtaattt ttatactcat 4060 
tataaaggtc tttccacttt gcttctcact 4140 
atcgccgccg ccaccatata gaaccactta 4200 
tcaactagtg ttaccaccta tggttcattg 4260 
tacaaacatt catatttctc tcaacatctg 4320 
gatctccata tttagttatt tcagttcttg 4380 
taataagaaa tttaactact tttcctcttg 4440 
ccaaaacccc tctagcaaag tgtggctagc 4500 
agtatcagta atgcacttcc ataaaagggt 4560 
gttttaatta agctttgaga cttgatttga 4620 
taactttgat aatacactat tcaatgacaa 4660 
catgccgcct tccaaagcac gccgaaagag 4740 
cactgggcct gttgccaact cgaaacctgg 4800 
tgacaagaaa aaggagggtc cacaggaacc 4860 
gaatatagtc ccacgacaag gattaggatg 4920 
tcggaagcga gagtacaagc ccaatagcaa 4980 
catcgggttc aatttcattg acatgcgcta 5040 
tagtgtaagc aaccgacttc tccctacctc 5100 
ttggggagtt ctatatggtg aacgaaaatg 5160 
ggtttataac tagaaaagga tttgagtaca 5220. 
acttaatatc ttttcctatt ttgttttttg 5280 
gggaaaatga tttaaaatta tttctcaata 5340 
gtcattttac acacacttta gtcattttac 5400 
attccttggt tcaattgaat gagatgattc 5460 
tagtttttca taactaaagc tttgagactt 5520 
ttctttctaa ctttgataat acactattca 5580 
ttcacagcat gccgccttcc aaagcacgcc 5640 
ccgttgccac tgggcctgtt gccaactcga 5700 
agcaacatga caagaaaaag gagggtccac 5760 
tggtggtgaa tatagtccca cgacaaggat 5820 
tgcctagtcg gaagcgagag tacaagccca 5880 
tctatgccat cgggttcaat ttcattgaca 5940 
gttgcaatag tgtaagcaac cgacttctcc 6000 
ttgaggtttg gggagttcta tatggtgaac 6060 
ggatcttggt ttataactag aaaaggattt 6120 
tagggaaact taatatcttt tcctattttg 6180 
aataaaaggg aaaatgattt aaaattattt 6240 
cattttagtc attttacaca cactttagtc 6300 
aagaatcatt ccttggttca attgaatgag 6360 
aacaatatag tttttcataa ctagaattct 6420 
tttcactttc ttgcccctta taatataata 6480 
tgggtatttt gggggatgga atgttactat 654 0 
gctatatttt . tattaggaat ttgatgttca 6600 
aatgtgaatt tgttgtgtat tttgattagt 6660 
ttaaattctc ttacgacgaa acaatattat 6720 
acctacctca cttcttgatc ttgcaggtga 6780 
ttggtcttct acaaaattat gttgatgagg 6840 
gttgttaatg ttaatttgat aatatgcttt 6900 
agaaggatga gtcattctac actctaagct 6960 
tgttggtggc cgctggaagc aatcggatca 7020 
tagataaggt ccctgcccct gtgcttactc 7080 
tgatgttcac atatcaatat ttcagtagtt 7140 
attcttgaac atcagttgac tttgctttga 7200 
taggagtcga atattcaata tgcctgttac 7260 
tgtactaaat atttaatttc cctttatgac 7320 
tcaatacatg agataaggac tcatgcctcg 7380 
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aagccatcac tcatcatttc tgccagcaag gttagtaata aatttgtcgt gtgtcgattt 7440 
ttttacactt tttaacatga cattattcta taggatgaat ctattaggct atggaatgtc 7500 
catactggga tttgcatctt agtctttgca ggggctggag gccatcgaca tgatgtgttg 7560 
agtgttgtaa gtatcgattg catcttgtct agacattgtt ttaaatatca cttgccccga 7620 
agataacact cattagaatt ctaatgttac catttgttat tgagcatgcc aaatttcaat 7680 
tttaacatca tagataaaat aagaccccac aattactttt actgtttatc tacttccatt 7740 
acattaggca taaagttact gataaaaaag acaatctttt atctgaagga cttccaccct 7800 
accgaggttg ggatttttgc aagttgtggc atggacaata ctgtgaagat ttggtcaatg 7860 
aaaggtttgg gaactacttt aaactagctt catgtttaca ttttgtgttg tatgttgcat 7920 
atcatdgaca aatattgcca atgttgtcac agaattttgg atatatgttg aaaaatcata 7980 
ttcatggact ggccatccat caaagtttcc aacgaggaat atccagtttc cggtatgtta 804Q 
agtagctata atcacctgag ctcctttctt tttttgcaaa ctattgttgg tgttcagttt 8100 
tcatgccatt caagcataca tgtttctttt cttttaggtc ttgactgctg cagtacactc 8160 
tgactatgtt gattgtacca agatggcttg gtgacttcat cctatcaaaa ggtaaattct 8220 
tcatttgtta aatggctata cattttttta taaaggaaat tttttattaa tttcaagcac 8280 
tttagattga aataatacaa aatcttaaaa aacatttttg gcctccattt aaacaagcac 8340 
aaatccaaca aaaatgagta aaccaaccca ttctagtgaa tattaatgca taaactagat 8400 
tgctacccat atgtctagaa aaagtagcct tgaccgcgta tcttaattgt caccatgccg 8460 
ccacaaccaa accgtgcaaa tatggttttt ggagaatgga ccaagtaaga aaccaatcaa 8520 
taattgagta tatagcatgc acaggagaaa tagatctctt attttcaaga acaatggtat 8580 
tttttattaa ccataggacc aacaagtagc gactacccat agcaaaacta atggcttcag 8640 
attattactg gttgttgaag tgtatacgtg gtttgcctac tttctcccaa tagtttaagc 8700 
ttttggattg aatcgattag tgcgttcact cttacatggt atcaaagtta gcaattttgg 8760 
gtttgaatcc taacggaagc tttatttgtg acttcacctc ttgttttcca tttcctttct 8820 
acctgcacgt gagtgggggt gttgaagtgt ataagtggat tgcctacctt atcaaccttt 8880 
tggattaaac tggttattgg ttagtgtgtt cactcctaca cctaagtatg aggtttagtt 894 0 
atccagtagc caattagatt atgcacagtg gacacttcac atgtgcaact agcactcaaa 9000 
acataagtct tt,aatt.gtct catcttatga caaaacaaca tatttcacta ccattct.at.a 9060 
acatcttgat ttgtacatca.gtcttgttaa tgctaaatag tgagatttga tcgtcaattg 9120 
gccagttgga tgtaaattcc agtgaaatac atcttgacct tgggttaaat ggacattagc 9180 
aatgtgtggg aacaaattgt tggtttgggt acaccaaact gttggttttt aattagtaga 9240 
ttagtttgta acacatttcc ttttatcagt gttagtattg gtttattatg catagggaag 9300 
gatctgatat gtgataatta acatggattt gcagagtgta aagaatgcag ttttgctttg 9360 
ggaaccaaaa ccagacaagc gtaggcctgg ggaggtgaca cgctttacct tctcgtcccg 9420 
aattctgcac ctatttttat attactatca tactcatcta cagtttaaaa cttgtcccgc 9480 
aatcttttca gtttctgagc actaaattta tacctctgaa tcagtatagt cgttttctct 9540 
ttgttcgtat aggggagtgt tgatgttctt cagaagtacc cggtgccaaa gtgttcatta 9600 
tggtttatga aattttcatg tgatttttac tccaaccaga tggcaatagg taatgccttt 9660 
aattttgtga agactgtttt ggcactaaag ctttacgtac gtaatattag ttttatatct 9720 
tgtacattga tggaaaatag attgctcaat atctatatat atgactatat cttgggttag 9780 
attctaagga acaaactctc ccagagtacg gttctgaata acaaccatct gctgctgctg 9840 
cttaatgcga acaggcaaca ataaaggcga gatctatgtc tgggaagtgc agtccagccc 9900 
gcccgtctta attgaccggt aaatttccag ttcttctcct cctcgcatcg gttcctgcat 9960 
gggtagctag ctagtaactc cgacgcttct gctggatgca aacacttgtg cattttcagg 10020 
ctgtgcaacc aggaatgcaa gtcgccgata aggcagaccg cagtgtcatt cgacggaagg 10080 
cacgtacgca ctacgactct cactatctgc tcatgcatgc attcaccgca cgtacgtgtg 10140 
atgtgctcgc tcgcttcctc cttttgtgat ggtgtctctc tcacttgccc agcacgatct 10200 
tggagccgcc gacgacggcg gatctggcgc ggtgggacga agtggaccct gctgcttcca 10260 
gctccaaacc tgatcaagct gctgcgcccg ccgccggtgc gggtgccgac gccgacgccg 10320 
acgcctgagc gagaggaccg tcgtcgcccg ccggttcaca tcgatcgtac tccgtgctgg 10380 
ctgattacct ttacccattg ggatgttttg gttcagagtc gccagatcta gtgtgtggct 10440 
gaacgttgaa tgttaggatg ctgctgcttg ttatgctctg agtcttgagt tctctttgtt 10500 
aatttgcacc gtggatgaga tgaataactt gacgttgcaa ctttgcatcc catatatgcc 10560 
gtaaatctgc cgtctgttgt ttgttctgcg ttgtctagaa ttagtggaga tgtgctggat 10620 
acaatgtatg ctagtctatt aaaccgtgct ccactctgag ataatcgacc aacttgtctt 1068O 
attattgaaa gaactgtgga aaaaaccaaa aaaagtcgtt gtggttttgt ttattatcaa 10740 
atatatttta cataagactt aaaagttttc attttttcat gaattttttg aataaaccga 10800 
gtagtcaaag ctagggtcaa aaaggcaaac atattatatt ttaaaatgga gagagagtac 10860 
attgttttaa gacgaattgt ttaatacaac tcgagaatat tctgatacat taatcctatg 10920 
atattaccat aaaaaacatt aatcctatga tagagtgtat aattacaaat gcacaaaggt 10980 
tcttttcatg tgaaatcgta ttatagatag gggtcatagc gcgcccttgt ccctacaact 1104 0 
tacgatgttc atgagttagg ttagaaaaag gttagagcaa gtatactaaa gtgacatatg 11100 
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caggctacaa ggaatgccac atcagatttt tggtgacgtt gaaggaagaa aaatagaggg 11160 
agaaaaaagc gaaccaattg cgaaggtgcc ttcttccaag ggcacggtcc atggagtgtg 11220 
gtagccgaca tcaaggtaga ggattatggt aaagttattt gagcaagtgt ctgacaacta 11280 
gcatgaaggc ttaggatttt ctaaatgcat ctttgagcgc tattgatgta gatgttaatg 11340 
atttttaggg ctgatgacca aaccaaagat gaacatggga acgnaaggaa ggttactgaa 11400 
agtgtatagg cccctagttt agtcttcagt gactaatgat aatatatatt attgtgacta 11460 
acaagtgttt tatagaaaca nggaaagtta gatcacaata atagatatga tcaggattat 11520 
tatgtggtac ccatccctta ttgatgaaaa tcaatggttg gttctcatag gataatcgaa 11580 
aaggttaagg atcaactgta ..aatggagttg ttggacactt agagtagtga tttgaccttt 11640 
tttctttggt agtactataa acggacatga aatgcgtagc tttacctaaa caagtctagt li700 
taagtatgat gatgcacact tgtgaatact agtgctaggt aaacccatga gatctcatgt 11760 
gaagttcgaa acaaaaccta attcgaaaag tgattaaaac atgtgactta acaatgttgt 11820 
agtagcattg gtcgagtttg atgggcacct gatatgggtc actagacatg agtgtgccct 11880 
gttgtgtttg agtgaagcac tagcatatca ggtgtgcaac agatatggtg cacccaggca 11940 
ggacacccaa agagcttgca aaattagcct aaaacactta gtgctcacca gacatatcta 12000 
gtgtactact agttattctc gttatatatg aaccctatta gttattcttg aattgcttcg 12060 
atcttttaca aaggaagtag tttttccttc atctccataa actgtggttt tccaaaggca 12120 
ttaataataa gatttagtat attaaattca aagttgaggt actttattat cgtgaaacca 12180 
acattaatac tatagactta actaaggagt ctattggtgc ttccttctca tgtattttct 12240 
tcttgaagtg ttccttcatc ttggtgctaa cgacgacatt caacaatgtg tgctcttact 12300 
tgattggttt gtatatatgg tggtgttcct ttacttagtg gcaacatacc ttatcgataa 12360 
ctaaccctta gtgaaagaaa tgaaaatgta catcccactg ggaaatcact cataccccta 12420 
agagctaact taatggaaca tcactcatag cccCaagggc tagttggaag tactttctca 12480 
tttcctgtat aagggctagt tcatgattca acttcttctc catttcttgg tgaactatct 12540 
tagcacgatt cctataaaaa catatacaac taaacaaagg gtggtggtac tgaacacagt 12600 
ggacccaagc actcggaaat gggaaggaca agttgcatgg aaaaaacgac aggctgggaa 12660 
ctattgtgtc ttgtcaagcg tgttcgtcca gctataggac atgggtattt atagggcaac 1272 0 
tagaggttgg tatcctaaaa tatgtccaga cccctagtta tcaactacgt tcctagataa 12780 
tactgtacaa caaggtaatt atagaatagt aagtttgtta ttctaactcc accccgacag 12840 
gtgggtccgt tgtcgcccgg ttgagagtgg gccctgctcg gccaggtcat tggcattgtc 12900 
cgtgcagacg tgttcccaat atcgaggcaa tgaagttgtt tgacacttct tcgggagtcg 12960 
gcgtgaggcc ttcgcttgct agcgcgaact tgcccacgag cgtcctcacc atgggccccg 13020 
ctgacaagct t 13031 



<210> 6 
<211> 11232 
<212> DNA 
<213> Zea mays 

<220> 

<221> unsure 

<222> (11155) ... (11232) 

<223> M a A, T, C, or Q 

<400> 6 

tttttcacac cgttactgtc atctaacaga 
gtcgaatttt gaggggcaaa ccatagttgc 
cccttaactt atatagttaa atatagttaa 
actattggct tgaccttata ttagttttgt 
tagatcttat aaacttaaga atattatgac 
atactcattg atgcattatt tatggtataa 
tatagtatat tgttagactt gtgtacatat 
tgagtcagct cgaacttata aacgacctga 
acaagtcaaa ccaagccgaa ctgactcgtt 
tgatttcaaa ttaagattgg tacaaaagtg 
tactctttat gtcaacaata gttgatgcta 
ttaattttat tgtcctaatg tcaattacta 
aagagtacat atctacgagg aaatttagat 
aatgtgcaaa ataagggtcc atgtaacata 
tgtataaact ataaaggttg ttgcttagaa 



agcaggtaca aacttgtttt tcgttttcaa 60 
acttccatcg agggacaaaa acacaattgc 120 
cgagcttgct actgagacta acaagtcaaa 160 
cttacacttt acaatcgttg atggctgctc 240 
tttatcactt tatttgtaat ggatgtatgg 300 
actatagacc atgaatgtat ggtgtaatgc 360 
atattattta tacttaactc acaaacttaa 420 
gtcgacctgg ccttatggct tgttaagata 480 
atccaaatct acacttacat aaacaaaaca 540 
ttttgtttta ttcaattaaa ccctacactg 600 
cgacaaagca atgaacattt tatggagtag 660 
ttgttagcca aggaatggag taagccaata 720 
atgtgcgtaa cttttttaat cgagatacaa 780 
catatatttc ttgtttttat ggtaaaagag 840 
gcgggattta ataacatcgg ttttatatta 900 
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accttaagtc cctatgcaat acctgtattt 
cacacattta agcacacata ctcacttgct 
ctagcacctt caaaagacaa aatagataaa 
aacgtccggt ctaaatcttg acaaaatatt 
ttgaacgtaa gtggttagag gaacctaacc 
ttagcttgct agtttttcta tacacaaaca 
tctaacttta atgatttctt tgagaaatcc 
agcaagagat ttgtacatgt ataaatacta 
tagcaaatat aaatttgaaa cataataaag 
agcgtaccat taaagatata cataattatt 
ccacattccc ctatctctac aaactctctc 
atctctcctt ctctcttttc aaatgttcta 
ttcctaccaa ccttgttgat gcttgtctca 
atccctattt caaaggcatt aatcatcaaa 
gatggagtct gatcgtgaga aacaacagtc 
caagatgcat gattctgatg gcaataaaaa 
tggtaaggaa cacaaatcca atataaagaa 
tctccttgaa aatcttggct tcaaactcaa 
gtctagagat tttgtgctta atatatgcat 
aataaaaaaa tcaatatgat caggaattaa 
ttgagttcca tggtcatacc atgatggtta 
taacttatac tcaaaatgtt gattggatgg 
ttgtgtaaca accatttgta ttggtttgcg 
taaataatta tttttcaaat ccaaagttgt 
ctacaagaat ttaaaatgaa ttgttaactt 
tttctcccat tatgatatat atattctcaa 
ttacatttgt tgggttgagt gctatatgtt 
ttgtagatgt ttgtgtttac ccaataagaa 
atagagttag tcttgacatg ttatattctt 
acattccttc catttaaacc taactccatc 
cttctttcta cactaacact aatgcttatg 
agtcaattac tgttacgttg ctagaaccaa 
ttcctccttc ttgggactct tcaaccatcc 
ctaccatgtc atcctactcc atatccaatg 
gcaccaaact aaccacaaaa taataatccc 
aagggagggg caattgtaaa tggtagtacc 
atgtctacat agttctagtc ctatgaagca 
ttagaggaag gggtggatcc ttactttcat 
ttatccatcg actgaaagtt cctcattgct 
tgacttttgt atgtactatt atctttgaag 
gatccattaa tcctccaaca cacccttaag 
aacatatttt aggtgacatg ggtatatgtg 
caatgatttt tagtcagaaa atggttgata 
ggcacataga gcaagttttt agaccatgaa 
atggtgtaat gctatagtat attaattatt 
taactcacaa acttaataag tcagctcgaa 
atggctcgtt aagctaataa gtcaaaccaa 
ttatgtaaac aaaacatgat ttcaaattaa 
attaaacgct acactatact ccttatgtca 
acattttatg gattagttaa ttttattatc 
atggagtaag ccaataaaga gtacatatct 
atttttttaa tcgagataca gaatgtgcaa 
cttgttttta tggtaaagga gtgtataaac 
taataacatc aatttcatat taaccttaag 
agtacctggt acaagcataa atacacacat 
cacacgtaaa ccctactcct actagcacct 
caaagtctat ttatggtata aactatatac 
ttgttagact tgtgtacata tatattattt 
tcgaacttat aaacgacccg agtcgaactg 
accaagccga gctgactcat tatccaaatc 
attaagattg gtacaaaagt gttctatttt 
tgtcaacatt agttgatgct acgacaaagc 



ttttctaagt acatggtaca aacacaaata 960 
atgagcacac acacgtaaac cctactccta 1020 
tcttgttgac aaagtctatt gaaaaatatc 1080 
agcacttgtg ccaagttaag aagtgagcac 1140 
aagttagtta tgttcaattt ttcatgcaag 1200 
ttatattagc ttataccatt gttgggaaat 1260 
ataagagcga taaagaggag agagagagag 1320 
tccattttct atttaagaat ctagacaaac 1380 
atgggcacct ggcatctcct ggatattaaa 144 0 
cacctcttct aggtataaat taccctacta 1500 
tcattgactc atcaagagag tgccacctct 1560 
caattatcaa ccatcataca acattcacct 1620 
actttctctt tacctagatc actcatatat 1680 
aacctataga aaaatcccat tatcaaccat 1740 
tcatggcaag aaacaaggtg accatggtag 1800 
tgtgtcagat gaaaagagtc aagagtctgg 1860 
acatgaatca cgtagaaaga ggtaagacat 1920 
gttaaattta tgtacacatg tttatataga 1980 
gcacatgagt tcaaataatt tcataataaa 2040 
accatgaaat ttttagagac atcatctaga 2100 
tgtcatttct ttccaatata aaaaattcct 2160 
aactttttct atagaattcc ttgccacatg 2220 
tctagtccac ttttgtgtgt tgctattatg 2280 
tcctccacat atctagaata tattctaatt 2340 
aagaatgcat tgttcaataC atttatgcat 2400 
tatttggcac, ataataactt ggaacattcc 2460 
tggattcatt aattatttac attgatattt 2520 
aaggccatta agaaaataaa atgttattag 2580 
ttaataattg gattttgtgg tatttccaac 2640 
tctcttatct tcctctatca tataccttat 2700 
tcactcctaa ccttgatgca acctaccaat 2760 
agattggtcc attggtgcac aatccattag 2820 
taactcccca aatgatttca aaagttttcc 2880 
tctactggtg ctagattcta tctactgtta 2940 
tacaaatata ggtggaggtg atgtaaaatt 3000 
atagatatca aaccttctca acttagagct 3060 
tcaaccattt tcttactaaa ctaaatattt 3120 
ctccatgagc ttccacccct tcctatgagc 3180 
ggagcttacc cgttattatc ccatgtcatc 3240 
tcgtaggcat gtggtaaatt cctaccttaa 3300 
acccaaacca taacgcctaa atccaatttc 3360 
atattagtta cttaatatag caagctctat 3420 
tgtttttagt ggttgtacta taattgaaga 3480 
tatatggtgt aaactataga ccatgaatgt 3540 
agacttatgg acatatatat tatttatact 3600 
cttataaacc acctgagtcg aactggcctt 3660 
gtcgagctga ttcattatcc aaatctacac 3720 
gattggtaca aaagtgttct gCtttattca 3780 
acaatagttg atgctacgac aaagcaatga 3840 
ctaatgacaa ttactattgt cagccaagga 3900 
atgaggaaat ttagatatgc gtgcaacttt 3960 
aataagggtc catgtaacat acatatattt 4020 
tataaaggtt gttgcttaga agcgggattt 4080 
cccctatcca atacatgtat tttatttcta 4140 
ttaagcacac atactcactt gttatgagca 4200 
tcaaaagaca aaacagatag atcttgttga 4260 
catgaatgta tggtgtaatg ctatagtata 4320 
atacttaact cacaaactta ataagtcagc 4380 
gccttatggc tcgttaagat aacaagtcaa 4440 
tacacttata taaacaaaac atgatttcaa 4500 
attcaattaa accctacact atacacctta 4560 
aatgaacatt ttatggatta gttgatgcta 4620 
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caacaaagta tattgttaga cttgctagat tctatctact gttagcacca aactaaccac 4680 
aaaataacaa tccctataac tataggtgga ggtgatgtaa aattaaggga ggggcaattg 4740 
tatatggtag taccatagat atcaaacctt ctcaacttag agctatgtct acatagttct 4800 
agtcctatga agcatcaacc attttcttat actaaactaa atatttttag aggaaagggg 4860 
tggatcctta ctttcatctc catgagcttc caccccttcc tatgagctta tccatcggtt 4920 
gaaagtttct cattgctaga gcttactcgt tattatccca tgccatctga cttttgtata 4980 
tgtactatta tctttgaagt cgtaggcatg tgtaaattcc cacctcaaga gtcaagatcc 5040 
attaatcctc caacacaccc ttaagaccca aaccataaca cctaaatcca atttcaacat 5100 
attttaggtg acatgggtat atgtgatatt agttacttaa tctagcaagc tctattaatg 5160 
atttttagtc agaaaatggt taatatgttt ttagtggttg tactataatt gaagaggcac 5220 
atagagcaiag tttttagtcg ttgtattcta aacaatgatt gatgtgtata aatttaataia 52 80 
attcattgtt gcatcttgtg tttcatacat ttgaaatgct ttgtgcctaa tctlitatgga 5340 
tgaagaagta '^atccttcta aacttttcct tccctgcaat ctttttaaac acactctaaa 5400 
ccccaaatat ctaatcctaa cctctaaacc tgatttaaat tttctaatct- agtccatttg 54 60 
tagtgctttt atatttagtc catttgcctt atgtgcctct tgtgtataaa' tagcgtagag 5520 
ttctgtataa tagtcaacaa gttttgcctt ttgttgtcgg atccattttc aatccttttg 5580 
tctagttcac ctattgttgt tgtgaaaaaa atgtcacaca ttttttactt ccccctatac 5640 
cacatactcc atcacggact aatgatcttc aaggtatgta tgctcagttt aaatccatgt 5700 
ctccacatac tccatcttaa gttcaagtct ctactttaag gtatgtaatt ttaaaacttt 5760 
gacgtattgt aattctataa ggagcaaatc tgaaaattaa ataaggaaaa actggtaaag 5820 
gcatgtttgg aaatcggaac gcagacattt tgttgttcct atgtttttct ttaaataaac 5880 
tcattcgtgt aaaatttctt caaaattcct ctccttcgaa cagatccttt tgcccccgga 5940 
cccctttcct acgcttgccc aaacccacaa aaccctcgcc gtcgcgccgc gcgattgcct 6000 
ctccggccgc cgcgagcccg cgacactagt aacggtctac accaccagaa tgactgaaga 6060 
attgaattcc agcaaattca agcttttgtt ttagccaaga tttgagattc gatttgaagt 6120 
gtggaagtcc ttccaatttg ccaatcctat atttgatctc tgctgtgctg cgttaaatcc 6180 
ctaaacttca cagcgcggcg ccggcccagc cacgccggaa gaggtcgccg cgtgaggtca 6240 
gtgtccccgt tgctgccgcc tctaacccga agcctaggcc gctgccggtg cataacaagg 6300 
agaatcaggc ggaggggaaa gtagcagagg agggggcagc aactgaggag ggggagaagt 63 60 
accgggcgga accggaaatc ttgccgctgc cgccggccat ggcgaagctg gaagcttttg 64 2 0 
ttttagccaa gatttgagat tcgatttgaa gtgtggaagt ccttccaatt tgccaatcct 6480 
atatttgatc tctgctgtgc tgcgttaaat ccctaaactt cacagcgcgg cgccggccca 654 0 
gccacgccgg aagaggtcgc cgcgtgaggt cagtgtcccc gttgctgccg cctctaaccc 6600 
gaagcctagg ccgctgccgg tgcataacaa ggagaatcag gcggagggga aagtagcaga 6660 
ggagggggca gcaactgagg agggggagaa gtaccgggcg gaaccggaaa tcttgccgct 6720 
gccgccggcc atggcgaagc tgggcccggg gcaggggctc gggtgcgagg cggcggaggg 6780 
gtcgctcgtg cccagccgga agcgggagta ccaagccctg cggcaagcac actgagggga 684 0 
agcgcccgct atatgctatc gggttcaact tcatggacgc gcgctactac gacgtcttcg 6900 
ccaccgtcgg cggcaaccgc gtaagccatc gactgctctc tcctgtcgtc ctttttttgt 6960 
ttctactgag gtttggggag ttcttgttga ttaatggcaa ggtaaaacta cgttgttttt 7020 
ttttgtgatt ttggtggtcg gttttaggaa gcggtcgctt ttgattcaaa tttgatctaa 7080 
agctgaggca ttcggttgtt tttattgggg acttgaggtc tgtaatgttc cgactattgt 714 0 
gatttgtttt gccgaaacat ggagtttgct agttcatttg atgaaaagct gcaacctttg 7200 
acaaagaatt tgtatcactt gggaaagtat agtgaggtgt ggggaatcag atagtaccaa 72 60 
tattactttg actatgatta taagataatc ttttaatgtc ctttgtaacg accatgctgc 7320 
ttttcgctta tcttgcctat tgatcttgca ggtgacaact taccgctgcc ttgagaatgg 7380 
tagtttcgct cttctacaag cttacgttga tgaggatgta agaaagacaa tgctcaatga 7440 
caatgctttt gcttgctgat ttaatattga taatattctt tctctaattc ttgtgacgcc 7500 
tatttacctc agaaggatga gtcgttctat actctaagct gggctcgtga ccatgttgat 7560 
ggctcaccac tgctggtggc agcaggaagc aatgggatca ttcgggtcat caattgtgct 7620 
acagaaaagt tagctaaggt aatctaccct tatatttgta tgtgttccta tggtaaactt 7680 
gaatgaagcc ttatttgcat aattcaatat ttcagttgtt tatttgacat atatcacttt 7740 
atttatgata tctgatccag aaggtctttt ggatttgctt tagttaagga atggtgcttg 7800 
ctacgcatta ataccataag caaactgtac cttttgctca cagaatattg ttaattttga 7860 
ctacttcagt atgtccgttg tagtaaaaac aaatcaactt ggtgtatcta ttttttcctt 7920 
gcttatacat agccaggaga ttgggcatgt ggcatgtcaa taaatactat cctataccat 7980 
ttgataggac acgcactgtg tcttatttgg tagctctgtt tacgtgattc tgcagagctt 8040 
tgttggccat ggcgactcaa taaatgtgat aagaactcaa ccgttgaagc cttcgctcat 8100 
catttctgca agcaaggtta tgcgatagtc tgttcttagg ttcatgtacc tttttatttt 8160 
tataatcttt ctgaattttg acaccatttc atatggcatt atctaatagg atgaatctgt 8220 
taggctatgg aatgtccata cagggatctg tatcttgata tttgctggag ctggaggtca 8280 
tcgcaatgaa gtattgagtg ttgtaagtag tgcctgctat tatgacattg tgcccttcaa 834 0 
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aaaaaacatt attatgacat tatttttaga acattactag gttaaggtgc ctttaatatg 8400 
gcgcactett tcagctcctg atattaccac tegttaccga gcgCtacaec agagataaaa B460 
taaggctacc taatgactgc tactgctttt gtactttgat tacattagtc ataaatgtac 8520 
tgatgaatac attattttgt cttaaggact tccatcctag tgatattgaa cgttttgcaa 8580 
gttgtggcat ggacaacact gtgaaaatct ggtcaatgaa agcfttagaaa gctacttcaa 8640 
agttgcttca tatttgcatg ttgcgtgtca ttgagttcac caatgttgtc gcagaatttt 8700 
ggctatatgt tgacaaatca tattcatgga ctgaccttca tcaaagttcc acaaaatatg 8760 
gccagtttcc agtatgtttc acaatgccta tatccaatta tcctggcaag gtcctgttgg 8820 
tgtctaatcc tcatgccatc agactgacct gtttcttttt gtttcaggtc ttgattgctg 8880 
cagtacactc taactatgtt gattgaacaa gatggcttgg tgacttcatc ctatcaaagg 8940 
tgaaatttct gattcgttta aatggataca aatttctgta gcacggttgt cactcttttg 9000 
tgggtttgac atgccactgt cttggttcat ctattgctgt accgtgcaag tgttcagttt 9060 
tttcaatctt ttttctcagt gcttaatgag gggagattct atttgcagag tgttgtcaat 9120 
gaaattgtgc tttgggaacc gaagacaaaa gaacagagtc ctggggaggt aattcagttt 9180 
aactttccca gaattgtatt cctattataa tgccatatat ttacgcacag ttgtaaacta 9240 
tttccagatc cttagatttc aaggtactgg ctgccaatat taaatatgtt ccactgaagt 9300 
aatatgattt tctgttgcct catagggaag catcgatatc cttcagaagt atcctgtccc 9360 
agaatgtgac atttggttta tcaaattttc atgtgatttt cacttcaatc agttggcgat 9420 
aggtaatatc tctcatcagg attgtttctg gtagaagttt tatttaagat tttttttgct 94Q0 
ctgtaaaatt tcacacacgc acacatgcac ccccacacac acacacatgc acgcacaccc 9540 
ccacccacct gcacgcgcgc gtacacacac accgcacaca tatatatgac tttttttccc 9600 
acacaaatat ttgctgtgtg agatatcagc aaataaattc gtatgtttga ttatattcag 9660 
agatatagga aaattgagtg ctctaatacc ccatccacta cttcaaacag gcaaccgtga 9720 
aggcaaaatc tacgtgtgga aaaatacagt ccagccctcc tgtcctcatt gctcggtagt 9780 
tttcactgga agagtttcag ttattcttgt ctcccacttg tatcgtcgca tgcttctgga 9840 
tgccaatgct tcatcatttt caggctgtat aatcagcagt gtaaatcgcc gataagacaa 9900 
actgcagtgt ccttcgatgg aaggtacctc actctaatcc atgctcaatt tggtgtactg 9960 
tctattctag cacttgcttt tttcttggtt ctgcttgaga aattctcgat tgcatgtcat 10020 
atgctggtgc attttctttt ttctgtttcc gtggcggatt ggtaaaatgc gacgatgcct 10080 
tccttatcta gcacaatcct tggagctggt gaagacggca ccatctggcg gtgggatgaa 10140 
gtggaccatc cgagctccag aaactgaaga agtgttgccg ctcaatgctg gactgatggt 10200 
tacgctcggt tggggttgtg atggttgaat ccgttggcgg aaagtgccac ctggtgtttt 10260 
tttctagtca aaatggttga tgttaacaga atattgaatg cttcgaatgt tgaaagttgg 10320 
gatgcttgtg ctggtactct gctccgcgga cgagtgaact tagtttgttg caactttggg 10380 
aaccgttgtc atctgtttgt tctgcatttc taaaaagaga gcaaatttca ggatacatgt 10440 
tctttttttt cagtacagga aaactaaggt tgaggtattg ctttgcaatt tactctctct 10500 
ctctctctct cttaaaaaaa ctggatcttg cttcaacgat gcattccttg ggtcatcggt 10560 
tttacttttg aaatcttgat agctgggcct aaagttacca agcccactag tatcagaagt 10620 
aataatatga tggctcctcc cctgccttac tgtcacgtgt aaactttcga aactagcagg 10660 
actgtagcat ttagcgagct ggttgtttgg gttagagctc agcgtcgcaa cttatggtac 10740 
cgaggtcagt gtcaagatct atggcaccat ggttcaatca cagttttagt cccaccaaaa 10800 
atataaaggt gaagtttcga caaaaaatgg ctagaataaa aaaaaacagg tccacatact 10860 
gaggagaaca catgacagat tcaccaagga ttttgaattg aaagaggcta atgattgaca 10920 
ggatttgatc ttcaattcca cctcccgttg tcctgcttct actctaaagt tcaagcgtgg 10980 
ctcagtttgg ctatctgtta taatttcaag aaatcctgat ttctgttagc agtttactag 11040 
gctattagga ggagctggga caaaagaaaa acgagaattg acgaggacaa attcgcaatt 11100 
agttgggaaa ttgggggcac aattttcaat gcccacaaaa ttcactcccc ctacntntgc 11160 
ggnggaatgg ggtcanncct cantgtcccc tgttnccggg acaagtntaa ctaacacatt 11220 
tccnnattnn tn 11232 
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(57) Abstract: Compositions and methods for identi- 
fying imprinting and genes regulated by imprinting are 
provided. The methods involve an analysis of die nu- 
cleotide sequence and the identification of CpG islands. 
At least two islands are involved in imprinting. Thus, 
genes can be identified that are differentially expressed 
based on parental inheritance. In this manner, the meth> 
ods are useful for determining the propensity of a gene 
to be influenced by imprinting. Such analysis involves 
detemiining the pattern of imprinting for cells of inter- 
est Fig. 2A shows the pattern of paternal and mater- 
nal ZmFiel allele expression in developing kernels. It 
is further recognized tfiat DNA constructs can be oon- 
stmcted which show difierential expression depending 



upon the parent-of-origin. To silence a paternally inherited allele, at least two CpG islands are utilized in the oonstnict. 
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